Monitoring for disturbance of optical fiber

ABSTRACT

Problems of excessive fading in systems for monitoring single-mode optical fiber for physical disturbances are addressed by launching into the fiber polarized light having at least two different predetermined launch states of polarization whose respective Stokes vectors are linearly-independent of each other; downstream from the first location, receiving the light from the fiber; analyzing the received light using polarization state analyzer means having at least two different analyzer states of polarization that are characterized by respective Stokes vectors that are linearly-independent of each other and detecting the analyzed light to provide corresponding detection signals; deriving from the detection signals measures of changes in polarization transformation properties of the fiber between different times that are substantially independent of said launch states and said detection states; and, on the basis of predefined acceptable physical disturbance criteria determining whether or not the measures are indicative of a reportable physical disturbance.

CROSS-REFERENCE TO RELATED APPLICATIONS

n/a

TECHNICAL FIELD

This invention relates to systems and methods for monitoring for physical disturbance of an optical fiber, and is especially applicable to monitoring single-mode optical fiber using polarized light.

BACKGROUND

There are many applications for monitoring an optical fiber to detect physical disturbances, including monitoring for attempts to access data signals being transmitted over optical fiber, monitoring for attempts to intrude upon a zone protected by an optical fiber deployed around it, monitoring of vibration in machinery with which the optical fiber is associated, monitoring pipelines along which the fiber is distributed, and so on.

In the context of monitoring data signals transmitted over optical fiber, for example, it is well-known that an optical fiber can be tapped, and light extracted, without interrupting the transmission of the data signal along the fiber. One approach is to bend the fiber to such a degree that some light leaves it and can be diverted to a suitable receiver. Such a bend may introduce a loss of up to 1 dB of the transmitted signal, and optical data networks have intrusion detection systems capable of detecting this power loss and providing an intrusion alarm.

An alternative, lower-loss approach to tapping an optical fiber involves carefully grinding away the outer part of the fiber in order to permit evanescent-wave coupling to an adjacent tapping waveguide. The power loss resulting from this kind of tap might be as low as 0.04 to 0.004 dB. Such a low loss might be impossible to detect reliably by intrusion detection systems which use power measurement methods.

An alternative class of detection techniques is predicated upon the fact that physically disturbing an optical fiber will cause changes in the propagation characteristics of the optical fiber for polarized light. Typically, the physical disturbances will be asymmetric, so the refractive index for one state of polarization of the light will be altered differently from that for another state of polarization, causing relative phase shifts that will generally change the polarization state of a transmitted light beam that is composed of a mixture of these two polarizations. More particularly, in the various applications of the invention mentioned above, bending, compression, twisting or torquing of the fiber may cause stress and strain that induce varying amounts of birefringence (linear or circular), resulting, for example, in a linear polarization state becoming elliptical or rotating. Such polarization changes can be detected as indications of a relevant disturbance of the optical fiber.

Optical fibers carrying data usually are in a cable, sometimes with other fibers and either of the above-described tapping procedures would entail physically disturbing the cable to such an extent that it is viable to design a monitoring system with enough sensitivity to detect an abnormal change in the polarization of light transmitted by the fiber to be tapped, or by a separate monitored fiber in the same cable, and initiate appropriate action, such as triggering an alarm, even before the intruder actually gained access to the fiber to be tapped.

Although, on the face of it, such a detection procedure might seem straightforward, in practice it is difficult to differentiate between normally-occurring disturbances and abnormal or “reportable” disturbances which should be reported because, for example, they result from a genuine intrusion attempt. The difficulty arises because effects in the fiber's environment can cause optical changes with similarities to some of those caused by intrusions. Such effects might include the effects of wind on a suspended cable, “stick-slip” phenomena combined with expansion and contraction during temperature changes, vibration from nearby equipment, or traffic or construction near a buried cable, and so on.

U.S. Pat. No. 7,173,690 (Haran) discloses a perimeter intrusion detection scheme comprising an optical fiber disposed around the perimeter of a building or other structure, for example by attaching it to a fence or burying it in the ground. Haran injects polarized light pulses into the optical fiber and detects changes in Rayleigh backscatter for different polarization states to determine occurrence of a disturbance and its location.

Haran uses a polarization optical time-domain reflectometer (POTDR) to capture a number (2 to 1000) of POTDR traces, digitally filters them and averages the digitally-filtered traces to form a “reference” trace which represents the normal or “no disturbance” condition. Thereafter, subsequent POTDR traces are acquired and compared with the reference trace. This process may take a typical POTDR several seconds (if not minutes) in order to achieve the desired signal to noise ratio (SNR) on a backscatter trace that is used for determining whether handling has occurred. As the length of the monitored fiber increases, a better SNR of the system is required, and therefore more averaging time. In view of the time taken to acquire enough samples for analysis, this method is mainly suitable for detecting slow changes, and is poorer for characterizing activity such as bumps or rattling that are associated with preparations to tap a fiber or other intrusions that have faster changes characterized by variations at moderate acoustic frequencies.

Other known techniques, such as those disclosed in U.S. Pat. Nos. 7,142,737, 7,206,469 and 7,693,359 and United States published patent application No. 2006/0153491, all by C. R. Murphy et al., and U.S. Pat. No. 5,061,847 (Hazan et al.) may sacrifice the position resolution of OTDR methods, but they require less signal-averaging time and can therefore capture acoustic-frequency disturbances with a good signal-to-noise ratio. This higher frequency response widens the range of criteria that can be used for distinguishing reportable disturbances from other changes. These techniques entail launching polarized light into an optical fiber, and detecting light emerging from the fiber through a polarizing filter. The detection signal varies when the fiber is disturbed because a disturbance changes the transformation or rotation that the fiber enacts on the polarization of the light, so that, under the right conditions, the fraction that passes through the filter will change.

The methods of Haran, of Murphy et al. and of Hazan et al. are subject to the phenomenon of “fading”: the sensitivity of the detection signal to a disturbance in the fiber depends strongly on (i) how the polarization state of the light entering the disturbed region happens to align with the geometry of the disturbance, and (ii) how the change in the polarization of the light emerging from the fiber happens to align with the polarization analyzer. (Alignment of polarization states with polarization transforming devices and filters is different from ordinary alignment in physical space, but is a well known concept.) The alignment is generally unpredictable depending on how different parts of the deployed fiber happen to be bent, twisted or stressed; and it will vary as environmental factors change (such as the temperature of the optical fiber cable). Thus, the sensitivity to a particular disturbance will be unpredictable and will vary randomly over time, between substantially zero and some maximum value. This unpredictable and variable sensitivity can make it difficult or impossible to set a threshold that the observed magnitude of a disturbance must exceed to make it reportable; specifically that is low enough to ensure reporting of substantially all disturbances that should be reported, yet high enough to reject changes that should not cause reporting.

In their patent specifications, Murphy et al. use detectors with two different polarization analyzers at the output of the fiber, so that, when a polarization change occurs in the light that reaches the output, if it is aligned so that one detector sees no change at all, the other detector will see a change. The resulting sensitivity may vary, but at least it will not go all the way to zero for polarization changes that reach the fiber's output. However, the polarization of the light passing through the disturbed region may still be aligned with a physical disturbance in such a way that no polarization change is created to first order in the magnitude of the disturbance, so it is still possible for the sensitivity to go all the way to zero.

Of the three references cited above (Haran, Murphy et al. and Hazan et al.), only Hazan et al. address the problem of fading in a concerted way. Related work is published in their paper “Buried optical fiber pressure sensor for intrusion detection,” in Proceedings of the 1989 International Carnahan Conference on Security Technology, pp. 149 to 154, section 4.1.3. In this paper and their U.S. Pat. No. 5,061,847, Hazan et al. explain the causes of fading in detail, and disclose a way to mitigate the effects. Nevertheless, such mitigation is not entirely satisfactory.

Hazan et al. do better than Murhpy et al. by using more than one launch state of polarization (as well as using more than one polarization filter for detection). Thus, if the light due to one of the launch states, when it reaches the disturbed zone, is aligned with the disturbance in a way that produces no polarization change, the light from the other will be aligned in a way that does produce a change (and at least one of detectors at the output will see a change). This prevents the sensitivity from ever going all the way to zero, and thus makes a substantial improvement. Hazan et al. disclose several simple ways to combine multiple signals from different combinations of launch state and analyzer state (e.g. adding absolute values, or creating a logical “OR” of the alarms from two separate signals). However, although such ways of combining the signals (each of which has severe fading by itself) may reduce fading, significant fading is still present resulting in significant uncontrolled variations in sensitivity. Consequently, Hazan et al's. approach may not be entirely satisfactory, particularly in an application where it is not enough merely to reduce fading, but it is desirable to effectively eliminate it.

Hazan et al. and Murphy et al. specify rather explicitly what polarization states should be launched into the fiber, and what polarization filters or “analyzers” should be used at the detectors. Thus, Hazan et al. use a pair of linear polarization states at the input and another pair for polarization analysis at the output, with the second linear polarizer in each pair at 45 degrees to the first; or at least within the range of 30 to 60 degrees. In another case, they specify adding a circular polarizer. Murphy et al. use a pair of polarizers at detectors, one of which is linear and the second of which is either a circular polarizer or a linear polarizer at 90 degrees to the first. (In fact, it appears that this 90-degree angle merely provides redundant information.)

Such design constraints cause undesirable challenges in the engineering, specifying and manufacturing of a product. For example, some convenient ways to split beams to be divided between, and coupled to, different polarization analyzers, including coiled optical fibers used for coupling, may involve phase shifts that cause the combination of a linear polarizer with its coupling means to be tuned to some elliptical polarization state, and to have a relationship to a second polarizing filter that cannot be characterized so simply as a 45-degree angle between two linear polarizers. Similarly, splitting means comprising detector preceded by a circular polarization filter or analyzer, when combined with its coupling means, may turn out to be tuned to a state that is very different from a circular polarization state.

In summary, such previous polarization-based optical fiber disturbance monitoring systems experience significant fading, reducing their accuracy, consistency and reliability in assessing the magnitude of a disturbance, and hampering the ability to distinguish between reportable disturbances and disturbances that should not be reported. In alarm systems, this can either increase the frequency of false alarms or make it easier for an intruder to avoid detection. Such previous systems also place unnecessary restrictions on the polarization states launched into the fiber and on the filters or analyzers at the output end.

An object of the present invention is to at least mitigate deficiencies of such previous optical fiber disturbance monitoring methods and systems, or at least provide an alternative.

DEFINITIONS

In this specification, the meanings of certain terms are as follows:

“Stokes 4-vector” refers to a four-component vector comprised of the four Stokes parameters (S₀, S₁, S₂, S₃); “Stokes 3-vector” and “Stokes vector” both refer to a vector comprised of three Stokes parameters (S₁, S₂, S₃), or to a unit vector with the same direction, as will be clear from the context.

A stationary light beam generally has a “state of polarization” or “polarization state” that can be fully polarized (a “pure” polarization state), partially polarized (equivalent to a pure state mixed with unpolarized light in some proportion), or unpolarized.

“Partially polarized light” (unlike unpolarized light) has a dominant polarization state (the pure component) that can be characterized by a Stokes vector, even though the Stokes 3-vector may not include the full information on the degree of polarization that a Stokes 4-vector does.

“Polarized” includes fully polarized and partially polarized, and excludes unpolarized, in application either to light or to an optical filter.

A “polarization filter” can be polarized or can be a “null” filter that has no effect on polarization of light passing through it, while a “polarized filter” or a “polarizing filter” does have an effect.

The term “polarizer” may designate any polarization filter, including a high-extinction device, an intermediate or low-extinction device, or a “null” filter that does not restrict the light passing through it on the basis of the light's polarization state.

An “analyzer” is the effective polarization filter between the output end of a fiber under test and a detector. It comprises any explicit polarizer or polarization-altering device that is placed in the path, plus any polarizing effects of minors, beamsplitters or other components.

The “extinction ratio” of a polarization filter or an analyzer is the ratio of the maximum fractional transmission for all pure polarization states of light to the minimum fractional transmission for all pure polarization states of light. The “extinction ratio” of a light beam is the ratio of its maximum fractional transmission for all perfect polarization filters (filters with infinite extinction ratio and no loss for an optimally aligned pure state) to its minimum fractional transmission for all perfect polarization filters

An “analyzer state of polarization” is associated with each analyzer, and may also be called the “polarization state” of the analyzer, or the “analyzer state”. It is the state of polarization defined by the four components of the top row of the analyzer's Mueller matrix, taken as the four components of a Stokes 4-vector. This characteristic state of the analyzer has an extinction ratio that is the same as the extinction ratio of the analyzer. Whether this extinction ratio is high or low, so long as it is greater than 1, the analyzer can be characterized (at least partly) by a Stokes 3-vector, and it may be called a “polarized analyzer”. If the extinction ratio is 1, the analyzer is similar to a null (unpolarized) filter and it cannot be characterized by a Stokes 3-vector.

A “launch state of polarization” is the polarization state of a light beam that is launched into a fiber. It will generally be at least partially polarized in the examples presented here, and can therefore be characterized by a Stokes vector.

“Change space” refers to a notional three-dimensional space which is used as one way to describe changes in the transformation that an optical fiber does on the polarization state of the light that it carries

The word “point”, when used in reference to data, can apply flexibly to different kinds of data point, such as a raw data point comprising a set of the signal values/samples from several detectors as measured at a particular time, or an averaged data point, comprising a set of averages of the signals in raw data points measured over some time interval. The meaning will generally be clear from the context

The word “block”, when used in reference to data or data points, can apply flexibly to different kinds of block, such as a set of raw data points comprising signals from several detectors as measured at a number of sequential times, or a set of raw data points comprising the signals from a single detector at sequential times. The meaning will generally be clear from the context.

SUMMARY OF INVENTION

According to a first aspect of the present invention, there is provided a method of monitoring a length of single-mode optical fiber (104) for physical disturbance along its length, comprising the steps of:

(i) at a first location (100), launching into the fiber polarized light having at least two different predetermined launch states of polarization (La,Lb) whose respective Stokes vectors (s_(La), s_(Lb)) are linearly independent of each other;

(ii) at a second location (102) downstream from the first location, receiving the light from the fiber;

(iii) analyzing the received light using at least two different analyzer states of polarization that are characterized by respective Stokes vectors that are linearly-independent of each other:

(iv) detecting (126, 128, 130, 132) the analyzed light to provide corresponding electrical detection signals (d1, d2, d3, d4) and deriving therefrom corresponding digital detection signals (D1, D2, D3, D4);

(v) deriving from the digital detection signals (D1, D2, D3, D4) one or more representations ([s_(a)(t₁), s_(b)(t₁)], [s_(a)(t₂), s_(b)(t₂)]; M(t₁,t₂); c(t₁,t₂); C(t₁), C(t₂)) of a change in the polarization coupling properties (R) of the fiber between a first time (t₁) and a second time (t₂); (vi) computing from said one or more representations at least one measure (4.14/4.15) of change in polarization coupling properties of the fiber, said at least measure being substantially independent of said launch states and said analyzer states; and (vii) determining with reference to preselected criteria of reportable physical disturbances whether or not said at least one measure indicates a reportable physical disturbance along said length of fiber.

According to a second aspect of the invention, there is provided a monitoring system for monitoring a length of single-mode optical fiber (104) for physical disturbance at a location or zone (L) along its length, comprises a launch unit (100) and a monitoring unit (102):

the launch unit (100) being adapted for launching into one end of the length of fiber polarized light at a first location light having at least two different predetermined launch states of polarization (La,Lb) whose respective Stokes vectors (s_(La),s_(Lb)) are linearly independent of each other;

the monitoring unit (102) being adapted for receiving the light from a second end of the fiber and comprising:

analyzing means for analyzing the received light using at least two different analyzer states of polarization that are characterized by respective Stokes vectors that are linearly-independent of each other:

means (126, 128, 130, 132) for detecting the analyzed light and providing corresponding electrical detection signals (d1, d2, d3, d4);

analog-digital conversion means (142) for deriving from the electrical detection signals corresponding digital detection signals (D1, D2, D3, D4); and

processing means operable for:

(i) deriving from the digital detection signals (D1, D2, D3, D4) one or more representations ([s_(a)(t₁), s_(a)(t₁)], [s_(a)(t₂), s_(a)(t₂)]; M(t₁,t₂); c(t₁,t₂); C(t₁), C(t₂)) of a change in the polarization coupling properties (R) of the fiber between a first time (t₁) and a second time (t₂);

(ii) computing from said one or more representations at least one measure (4.14/4.15) of change in polarization coupling properties of the fiber, said at least one measure being substantially independent of said launch states and said analyzer states; and

(iii) determining with reference to preselected criteria of reportable physical disturbances whether or not said measure or measures indicate a reportable physical disturbance.

Preferably, in embodiments of either of the first and second aspects of the invention, said one or more representations comprise a plurality of said representations ([s_(a)(t₁), s_(a)(t₁)],[s_(a)(t₂), s_(a)(t₂)]; M(t₁,t₂); c(t₁,t₂); C(t₁), C(t₂)) over a predetermined period of time, each successive representation being stored and with preceding representations to provide change information comprising a sequence ([s_(a)(t₁), s_(a)(t₁)], [s_(a)(t₂), s_(a)(t₂)], [s_(a)(t₃), s_(b)(t₃)], . . . ; M(t₁,t₂), M(t₂,t₃), . . . ; c(t₁,t₂), c(t₂,t₃), . . . ; C(t₀), C(t₁), C(t₂), . . . ) of change information (4.23) for the fiber polarization coupling (R) over said predetermined period of time.

The criteria may take a variety of forms depending upon the particular application of the system. The criteria may be obtained following installation by accumulating data over a period of time, free of disturbances, to establish a baseline for future reference when setting such criteria as change-space distance thresholds, frequency thresholds, number of times thresholds are exceeded in a specified time interval, and so on.

The measure(s) and associated criteria may be based upon the time domain and/or the frequency domain and/or geometric or dimensional properties of changes and/or other methods of analysis. A time-domain determination may be based on a measure of the magnitude of the change in the fiber over a prescribed time period. Either a time domain or a frequency domain determination may be based upon a measure of apparent movement or oscillation (of the fiber) in a manner that changes its polarization coupling properties in a one, two or three parameter manner over a prescribed time period, and this may provide information about the geometric complexity of a disturbance. The frequency-domain determination may be based upon using Fourier analysis to convert the change information over a prescribed time period into the frequency domain and obtaining a power spectrum characterizing the apparent disturbance, then computing measures of the power distribution in different frequency bands and comparing with values for these measures that are known to characterize reportable and/or unreportable disturbances.

Other aspects of this invention include the launch unit per se, launch method per se, the monitoring unit per se and the monitoring method per se, for use in the method or system of the first and second aspects.

More-specific features of the monitoring system and method may be as set out in the claims attached hereto and/or correspond to the foregoing method steps.

The foregoing and other objects, features, aspects and advantages of the present invention will become more apparent from the following detailed description, taken in conjunction with the accompanying drawings, of preferred embodiments of the invention, which are given by way of example only.

BRIEF DESCRIPTION OF THE DRAWINGS

In the drawings, identical or corresponding elements in the different Figures have similar reference numerals, in some cases with a prime or double prime denoting a difference.

FIG. 1 is a simplified block schematic diagram of an optical fiber monitoring system embodying the invention and comprising a launch unit and a monitoring unit connected to respective ends of a length of optical fiber to be monitored;

FIGS. 2A and 2B illustrate how the length of optical fiber transforms two different launch states of polarization at different times;

FIG. 3A illustrates similar transformations to those shown in FIGS. 2A and 2B but using the well-known Poincaré Sphere;

FIG. 3B illustrates a Poincaré Sphere showing of the output polarization states relative to analyzer states;

FIG. 4 illustrates signal processing and data flow in the processing unit of FIG. 1;

FIG. 5 illustrates the relationship between the launch polarization states and a corresponding detection signal over several time intervals, and depicts vectors and matrices used to derive from the detection signal a representation of fiber transformation changes;

FIG. 6 illustrates timing and state diagrams for signals in the launch unit and the monitoring unit of FIG. 1;

FIG. 7 illustrates correction of phase offset of detection signals for registration purposes;

FIG. 8 is a high level flowchart of the phase offset correction and registration process followed by reduction of data points to one per half cycle;

FIG. 9 is a flowchart showing in more detail phase correction steps of the flowchart of FIG. 8;

FIG. 10 is a flowchart depicting computation of change information (sequence of change-space position vectors) representing successive positions of the fiber transformation state in a notional three-dimensional space;

FIG. 11 is a flowchart depicting computation of distance moved in change space over time;

FIGS. 12A, 12B and 12C are graphs which depict the position in change space over time for each of three change space axes and FIG. 12D is a graph depicting the distance moved in change space as in FIG. 11;

FIG. 13 is a flowchart depicting computation of power spectra for three dimensions x, y and z, respectively, from the Change Information of FIG. 10, combining them to form an isotropic power spectrum, and evaluation to determine whether or not a reportable disturbance occurred;

FIGS. 14A, 14B and 14C illustrate portions of the three power spectra for the three dimensions x, y and z, respectively, and for a short time interval, while FIG. 14D shows the isotropic power spectrum corresponding to the sum of the spectra of FIGS. 14A, 14B and 14C, respectively;

FIG. 15 is a flowchart depicting decision and reporting processing for determining whether or not to report a disturbance;

FIG. 16 illustrates schematically a second specific embodiment of the invention in which the launch unit uses two launch states and the detection unit uses two analyzer states and an unpolarized detector;

FIG. 17 illustrates schematically a third specific embodiment of the invention in which the launch unit uses two launch states, the detection unit uses two analyzer states and a clock signal from the launch unit;

FIG. 18 illustrates schematically a fourth specific embodiment of the invention in which the launch unit uses two launch states and the detection unit uses three analyzer states;

FIG. 19 illustrates schematically a fifth specific embodiment of the invention in which the launch unit uses three launch states and the detection unit uses two analyzer states; and

FIG. 20 is a block diagram of a polarization controller of the launch unit of FIG. 19.

DESCRIPTION OF SPECIFIC EMBODIMENTS

Intrusion (disturbance) detection by embodiments of the present invention is predicated upon the fact that bending, twisting or compressing an optical fiber changes the polarization transformation properties of the fiber and so may result in concomitant changes in the state of polarization of light propagating along it. Monitoring systems embodying this invention, one specific example of which is illustrated in FIG. 1, comprise a launch unit 100 and a monitoring unit 102 interconnected by a length of optical fiber 104 to be monitored. The monitoring unit 102 comprises a detection unit 106 and a processing unit 108. The launch unit launches into the length of optical fiber 104 light that has “launch states” of polarization that are significantly polarized and characterized by Stokes vectors that are linearly-independent of each other. After the light has traversed the length of fiber, and been received by the monitoring unit, the detection unit analyzes its polarization using analyzers having at least two different “analyzer states of polarization” that are significantly polarized and are characterized by Stokes vectors that are linearly-independent of each other.

In contrast to the previously-known fiber movement/disturbance monitoring systems discussed hereinbefore, in monitoring systems embodying the present invention the detection signals are processed to create at least one measure that is substantially independent of the actual polarization states used to “probe” the fiber, specifically the launch states and analyzer states, and hence characteristic of the fiber polarization transformation itself and how it changes over time.

In embodiments of this invention, the processing unit compares the measure(s) with criteria predetermined as indicative of a reportable disturbance and, on the basis of the comparison, may output an alarm or other signal via an alarm/output unit 110. The measures and corresponding predetermined criteria may be distance-time-related or power-frequency-related (i.e., power spectrum), or a combination of both, as will be described more fully later.

The way in which polarization transformation properties of the length of fiber change the polarization of the light propagating along it, over time, is illustrated in FIGS. 2A and 2B, for two launch states of polarization. Thus, in FIG. 2A, two “launch states” of the light launched into a fiber at a first time t₁ are represented as “L_(a)” and “L_(b)”, respectively. These diagrams indicate graphically the direction of oscillation of the electric field, showing as examples horizontal linearly polarized light for launch state La and 45-degree linearly polarized light for state Lb. The polarization transformation properties of the fiber, extant at that time t₁, change the polarization states of the light as it propagates along the fiber resulting in different output polarization states formed at the far end of the fiber 104, as represented by a(t₁) and b(t₁). In this example, the linear polarization state L_(a) has been transformed into a circular polarization state and the 45-degree linear polarization state Lb into a vertical linearly polarized state.

At the later time t₂, depicted in FIG. 2B, the launch states L_(a) and L_(b) are the same, but the fiber transformation is different because of some movement of the fiber 104 relative to its condition at time t₁. Consequently, the transformed output state a(t₂) has changed a little to be somewhat elliptical, and the transformed output state b(t₂) has remained linear but has rotated about the axis of the fiber.

This transforming of the launch states differently at the different times t₁ and t₂ is illustrated symbolically in FIG. 3A using Poincaré spheres X0, X1 and X2, and matrices representing the polarization transformation properties of the fiber at times t₁ and t₂ are indicated by symbols R(t₁) and R(t₂), respectively. The launch states are represented by their associated Stokes vectors s_(La) and s_(Lb), and these vectors are shown graphically on the first Poincaré sphere X0. Second Poincaré sphere X1 shows the Stokes vectors s_(a)(t₁) and s_(b)(t₁) representing the output states of polarization of the light as received at time t₁, after transformation by R(t₁) and third Poincaré sphere X2 shows the Stokes vectors s_(a)(t₂) and s_(b)(t₂) representing the output states of polarization of the light as received at time t₂ after transformation by R(t₂). The relationships are simple matrix multiplication of a column vector, vis. s_(a)(t₁)=R(t₁) s_(La) at time t₁ and s_(a)(t₂)=R(t₂) s_(La) at time t₂.

A comparison of Poincaré sphere X1 with Poincaré sphere X2 shows that changes in the fiber's polarization transformation properties which occurred between times t₁ and t₂ caused the Stokes vectors of the two output states to move on the Poincaré sphere. The change in the fiber's polarization transformation properties, between the two times t₁ and t₂, i.e., between spheres X1 and X2 in FIG. 3A, is expressed as a change matrix M(t₁,t₂), where M(t₁,t₂)=R(t₂)R⁻¹(t₁). This way of constructing it simply transforms the t₁ output states backwards to the launch states and then forwards to the t₂ output states, and is equivalent to R(t₂)=M(t₁,t₂) R(t₁). Although this relationship is a good way to define and explain the change matrix, in practice the processing unit will not actually compute it this way from the data because it will not actually determine the R matrix that describes the fiber's polarization transformation.

FIG. 3B shows not only the output states but also two squares representing analyzer states used to analyze the polarization of the received light. It should be noted that the analyzer states are stable relative to the output end of the fiber, as are the launch states relative to the input end, but the relationship between the launch states and analyzer states need not be known.

Change matrix M(t₁,t₂) is a rotation matrix that describes the rotation of a sphere between times t₁ and t₂, and the rotation of a sphere is defined uniquely by the motion of two points on its surface. (See Rotations, Quaternions and Double Groups by Simon Altman, Dover, 2005, pages 22-23.) Therefore, it is sufficient to determine the Stokes vectors s_(a) and s_(b) for the two output states at each of the two different times t₁ and t₂ in order to determine the change matrix M(t₁,t₂) from measurements of the detection signals; with no reference to the launch states per se or the matrix R for the fiber's polarization transformation. Preferred embodiments of the present invention, in effect, accumulate determine a series of such change matrices over a predetermined period of time and use them to derive one or more measures dependent upon movement of the fiber during that time period. More particularly, they convert a sequence of measured change matrices into “Change-Space Position Information” conveniently represented as a sequence of vectors representing successive positions in a notional three-dimensional space called “change space”; and they subsequently reduce the change-space information to one or more practical measures or metrics suitable for evaluation to determine whether or not a reportable disturbance has occurred. One measure may be the distance or length of displacement in change space over a particular time (approximately equivalent to an angle of rotation on the Poincaré sphere); another might be a power spectrum (function of frequency) that can be reduced further by integration over specific regions of frequency to create simpler measures that are useful for reporting. A category of useful reduced measures relates to the dimensional and geometric properties of the change in the fiber (e.g. one-, two- or three-dimensional parameter changes) or of the disturbance that causes it.

As a result of their substantial independence of the launch and analyzer states, these measures are substantially independent of changes in static twisting, bending or stressing of sections of the fiber that are removed from the zone of the disturbance. In other words, they are substantially free of fading.

The signal processing involved in deriving the measure(s) from the output states, and then evaluating it/them to make a decision, is illustrated in FIG. 4 in terms of process steps and data flow and depicted schematically and mathematically in FIG. 5 in terms of computation of vectors and matrices for several cycles of the polarization modulation.

FIG. 5 shows three cycles of one of the detection signals identified by their medial times t₁, t₂ and t₃, the various operations shown in box 5.01 of FIG. 5 represent mathematically the processing performed in process 4.13 of FIG. 4 to derive the change information from the detection signal as a sequence of change space vectors.

Thus, in FIG. 4, the major signal-processing steps 4.01 that are performed by the processor 144 to derive the change-space position information represented by vectors C(t₁), C(t₂) and C(t₃) in FIG. 5 are shown as Data Acquisition process (4.11), Pre-processing process (4.12) and Computation of Fiber Polarization Transformation Change Information process (4.13), the corresponding data sets 4.02 that are inputs and outputs for these processes being “raw” Data (4.21), Averaged Detection Signals (4.22) and Change Information (4.23), with arrows indicating the flow of data into and out of the processes.

FIG. 5 does not necessarily represent the grouping of operations into subroutines or loops. It shows both the names and the symbols for the mathematical objects that are used as intermediates in this particular description of embodiments of the invention, for operating on detection signals to compute the measure(s) of fiber transformation changes. The processor uses the Change Information 4.23 in Measure-1 computation process 4.14 and Measure-2 computation process 4.15 to compute these two measures and accumulates them over time to produce Measure-1 History data 4.24 and Measure-2 History data 4.25 which it uses in Decisions and Reporting process 4.16.

The Decisions and Reporting process 4.16 is shown as a single process but it may be used with the Measure-1 History and Measure-2 History independently to make a separate decision for each, though the decisions need not be reported separately.

Except where noted below, in the specific embodiments described herein, the signal processor 144 carries out the data processing steps independently and asynchronously (e.g. in different threads); and the output data sequence created by each process step is stored in a corresponding circular buffer with enough capacity to ensure that subsequent process steps that use the data have access to what they need. (Such buffering is known to those skilled in the art and will not be discussed in detail here.)

In data acquisition process 4.11, the processor 144 acquires “raw” digital data from the A-to-D converter unit 142 for the four detection signals D1-D4 (FIG. 1) and stores it in a buffer as Raw Data 4.21. (Data acquisition is a process generally known to those skilled in the art so process 4.11 will not be described in detail here.)

In process 4.12, the processor acquires a block of “raw” data 4.21 from the detection signals, and first performs phase offset measurement and correction or registration, as will be described in more detail later with reference to FIG. 8. (An exception being “synchronized” embodiments in which the launch unit supplies a synchronizing signal to the monitoring unit.) This phase correction and registration process ensures that each detection signal will be correctly sampled and that average values for the signals will be derived accurately from the samples in the relatively unchanging medial portion 5.13 (shown bold in FIG. 5) of each half-cycle of the detection signal, illustrated as trace 5.14 in FIG. 5.

The correspondence between the alternating detection signal and the alternating launch states of polarization (see FIGS. 2A, 2B and 3A) is indicated in FIG. 5 by showing above each half-cycle of the detection signal the respective one of the Stokes vectors s_(z), and s_(Lb). (as at 5.12). As will be described in more detail later with reference to FIGS. 8 and 9, the processor averages the sample values represented by the highlighted section 5.13 to reduce noise effects and create a single sample value for each half-cycle. These sample values form the averaged detection signals data 4.22 used by processor in process 4.13 to compute the change information data 4.23.

The sequence of averaged detection signals 4.22 stored following process 4.12 contains sufficient information on successive changes in the fiber transformation that process 4.13 (see also 5.01) can “purify” it from dependence on the launch states and analyzer states of polarization. The first step is to reduce it to a sequence of pairs of output Stokes vectors independent of the analyzer states of polarization. After that, the information will be further reduced to a sequence of “change matrices” that are also independent of the launch states of polarization. These will be converted to another form of change information sequence 4.23, and subsequent measures of the fiber disturbance will be computed from this change information sequence.

Thus, referring also to FIG. 5 and using the cycle at time t₃ as an example, the two averaged detection signal values from the cycle t₃, one in each half cycle, are used to determine the pair of Stokes vectors s_(a)(t₃) and S_(b)(t₃) (5.15). This pair of Stokes vectors s_(a)(t₃) and S_(b)(t₃) is combined with the pair s_(a)(t₂) and S_(b)(t₂) from the preceding cycle t₂ to compute the 3×3 change matrix M(t₂,t₃) (5.17) representing the change in the fiber polarization transformation between the two cycles. This 3×3 change matrix M(t₂,t₃) is converted (5.18) directly into a 3-element “step” vector representation c(t₂,t₃). Step vector c(t₂,t₃) is added to the accumulated sum C(t₂) of the previous step vector(s) to form sum vector C(t₃), representing the current position in notional “change space”.

The pairs of output Stokes vectors s_(a)(t) and S_(b)(t) are substantially independent of the analyzer states used to analyze the polarization of the detection signal, except in the sense that the numerical values of their components depend on the basis used to represent the vectors, and the basis will usually be chosen in relation to properties of some of the analyzer states. But they are not independent of the launch states of polarization L_(a) and L_(b). However, when the two pairs of Stokes vectors for the two polarization cycles are reduced to a “change matrix” (5.17) that represents the change in the fiber polarization transformation, the change matrix is not only substantially independent of the analyzer states (except for basis) but also substantially independent of the launch states of polarization (with no basis relationship). Hence, the change matrices M(t₁,t₂), M(t₂,t₃) and so on, the step vectors c(t₁,t₂), c(t₂,t₃) and so on, and the change-space position vectors C(t₁), C(t₂), C(t₃), and so on, are all dependent, in effect, only on changes in the polarization transformation properties of the fiber over the time concerned. Any property or measure that is computed from any one of them and that is basis-independent (or rotation and reflection independent) will thus be substantially independent of both the launch and detection states, without even a basis dependence. Such measures include, for example, the length of a vector, the angle between two vectors, or the angle of rotation indicated by a rotation matrix.

It should be noted that, when each new change-space position vector is computed, its predecessor is not discarded, i.e., C(t₁), C(t₂), C(t₃) and so on are saved in a buffer. Hence, a sequence of change-space position vectors is saved, representing a path through change space. The change-space position vector C(t₃) represents the change-space position at time t₃ (5.11). The cumulative change-space position data represented by sum vectors C(t₁), C(t₂), C(t₃), and so on, stored in the buffer as Change Information 4.23, will be used by the processor in both of steps 4.14 and 4.15 to compute Measure-1 and Measure-2, respectively, for each evaluation by Decisions and Reporting process 4.16 to determine whether or not a reportable disturbance has occurred.

4.14 Compute Measure-1; Change-Space Distance

The change information 4.23 will be accumulated on a continuous basis, earlier change information being discarded, as appropriate, by the circular buffer. To compute Measure-1, in computation step 4.14, the processor 144 takes current change information 4.23 and previous change information that was computed a time period t_(r), earlier, usually many polarization modulation cycles earlier, and reduces the current and previous change information to a measure of the distance moved in notional change space over time period t_(D). This movement is approximately equivalent to the corresponding rotation (in radians) of the fiber transformation on the Poincaré sphere over that time period t_(D).

After a short delay that is a fraction of the time period t_(D), the processor 144 obtains two more samples of the change information, preferably, but not necessarily separated by the same time period t_(D), and computes from them another distance moved. The processor repeats this process, storing a sequence or history of this distance moved over a predetermined period of time, as the Measure-1 History 4.24. Periodically, the processor evaluates this Measure-1 History data with respect to preset criteria deemed characteristic of an intrusion attempt in order to decide whether a reportable disturbance has occurred (see process 4.16).

4.15 Computation of Measure-2; Power Spectra

To compute Measure-2, in computation process 4.15, the processor 144 takes a series of blocks of recent Change Information data 4.23, each block equivalent to a relatively large number of samples of the detection signal 5,14 (FIG. 5), and uses Fourier analysis to reduce each block to an “isotropic power spectrum array” (as explained in more detail hereinafter with reference to FIG. 13) that characterizes the changes in the fiber polarization transformation over the duration of the block.

The processor may then weight each of the isotropic power spectrum arrays to obtain a series of weighted integrated power values and store these values sequentially as the Measure-2 History 4.25. The processor evaluates this history 4.25 in Decisions and Reporting process 4.16 with reference to preselected spectral power parameters, e.g. thresholds or other spectral attributes, deemed characteristic of an intrusion in order to decide whether a reportable disturbance has occurred.

4.16 Decisions and Reporting

In decisions and reporting process 4.16, the processor 144 evaluates the recent histories 4.24 and 4.25 of measures 1 and 2, respectively, against criteria (for example thresholds or parameters) that have been preset, either by the manufacturer, the installer, or the user, according to the application for the system, to determine whether a reportable disturbance has occurred. In this specific exemplary embodiment, the process 4.16 makes the decisions for the two different measures independently with no regard for correlations between the two. In fact, the decision as to whether or not to signal a reportable disturbance condition for each measure might be considered as being part of the corresponding process 4.14 or 4.15 that computes that measure.

Hence, the processor will produce two independent decisions, one based upon a comparison of Measure-1 with preselected distance-related characteristics indicative of an intrusion, and the other based upon a comparison of Measure-2 with preselected power spectrum-related characteristics indicative of an intrusion. Other similar embodiments might use one or the other of Measure-1 and Measure-2 alone, or use more complex decisions and reporting processes, including correlations between the two different measures. For example, if a particular situation is likely to involve disturbances characterized by relatively slow changes, it might be enough for a system to be programmed to accumulate and evaluate only Measure-1 data.

It will be appreciated that, as is common for security systems and other similar applications, the actual values used for thresholds, time intervals, and so on will vary widely with the application, the particular installation, and the preferences of the user; and the processor may permit the user to set the parameters to whatever values, and by whatever procedure, that he wishes to use.

To this end, the processor 144 will have means (not explicitly shown in the diagrams) by which the user can communicate with it in order to set the values of parameters that it uses for processing. Sets of thresholds characterizing reportable disturbance characteristics might be preset by the manufacturer based, for example, upon empirical data, and a particular set suitable for a particular situation selected by the user. The preset thresholds etc. might be updated by the user, or automatically by the processing unit, based upon actual usage over a period of time with and without real reportable disturbances occurring. The manner in which these thresholds are determined and used are known to those skilled in this art and need not be described in more detail here.

The particular ways of producing and representing the fiber transformation change information in the foregoing description are chosen for their convenient intuitive and physical meanings in illustrating the principles of the invention. Other ways of deriving change information may be employed, using different hardware and/or software configurations, for example, so long as the derived change information 4.23 will be substantially independent of launch states and analyzer states while remaining dependent upon changes in the fiber polarization coupling properties.

Several different embodiments will now be described, by way of example only, to show how the above-described process can be carried out with various combinations of launch states and analyzer states. The different embodiments will generally require different computations for determining the Stokes vectors of the output states and the changes in the fiber polarization transformation, and may have different tradeoffs between cost and sensitivity or noise. Their operations will be described up to the point of determining the Stokes vectors that characterize the output states, i.e., the Change Information 4.23. In general, the Change of Information will be similar in form for all of these embodiments. Thereafter, the computations of Measure-1 and Measure-2 and their Histories, and the Decisions and Reporting processes, may continue as described hereinbefore with reference to Steps 4.14, 4.15 and 4.16 of FIG. 4, subject to modifications to be described afterwards.

In order for the calibration matrix W to exist and work properly, it is necessary for the four Stokes 4-vectors that characterize the four analyzer states to be linearly-independent. This can be proven by noting that the equation P=W⁻¹ S defines the four detector powers as a linear sum of the Stokes 4-vector components, and the matrix designated as W⁻¹ necessarily exists (even if it is not invertible and its inverse, W, does not exist). Because the P components are just powers or the S₀ terms of the Stokes 4-vectors for light passing through an analyzer, each row of W⁻¹ is simply the top row of the Mueller matrix that defines the polarization transformation done by the corresponding analyzer (134, 136, 138,140). But these rows are identical to the Stokes 4-vectors that describe the analyzers. If they are not linearly independent, the matrix W⁻¹ is singular and has no inverse. But if they are linearly independent, then the inverse W exists, i.e. there is a calibration matrix.

A system with four analyzer states characterized by Stokes 4-vectors that are linearly independent will automatically meet the simpler requirement that there should be at least two analyzer states that are polarized to a significant degree and are characterized by Stokes 3-vectors that are linearly-independent. It should be noted that a Stokes 3-vector can characterize an analyzer state so long as it is a polarized state, even though the Stokes 3-vector does not include the extinction-ratio information that a Stokes 4-vector characterization would include. One of the four analyzer states can be completely unpolarized (thus having no associated 3-vector), but not two.

One method for constructing the calibration matrix W is implied by the proof of its existence that appears above. The matrix W⁻¹ is simply constructed so that each row is the top row of the Mueller matrix that defines the polarization transformation done by the corresponding analyzer, or so that each row is the Stokes 4-vector that characterizes the corresponding analyzer state. Then its inverse is computed to get W. Alternatively, the matrix W can be calibrated by methods that are described by the method of Mikhailov et. al, described in “Robust Remote Calibration of Fiber Polarimeters,” in Optical Fiber Communication Conference, OSA Technical Digest (CD) (Optical Society of America, 2011), paper OWC4.

In order to achieve a good signal-to-noise ratio, it is valuable for the four Stokes 4-vectors representing the analyzer states to be, in some sense, “robustly” linearly independent. If the 4-vectors should be too close to a condition of linear dependence, then, even though the matrix W⁻¹ is well-behaved, its inverse W might be nearly singular, in a way that causes the four weights (one row of W) used for computing at least one of the Stokes vector components to include large positive and negative numbers. The weighted sum could then include a small difference between large multiples of two or more detection signals, and would be very sensitive to small noise-induced changes in the measured detection-signal values. The Stokes 4-vectors for the analyzer states should therefore be far enough from a condition of linear dependence to support a signal-to-noise ratio that is sufficient for the particular application.

Similar requirements apply to the linear independence requirement on the Stokes 3-vectors characterizing the two launch states of polarization. These two states should also be sufficiently polarized to support a good signal-to-noise ratio. These conditions are not difficult to achieve, and the range of states that may be used for launch states and analyzer states remains broad. For example, a commercial product suitable for use in this application, the General Photonics PolaDetect™ comprises an assembly with four detectors preceded by polarization filters, having analyzer states that can be described approximately as x-polarized, y-polarized, 45-degree polarized and circularly polarized. The states are not perfect in these polarizations or in their relative alignment, and their extinction is not perfect, but this is immaterial. The product is provided with a calibration matrix.

As will be described, the four detectors of the first embodiment allow an output Stokes vector to be determined from a simple weighted sum of the four detection signals, provided the four analyzer states meet certain simple criteria. Other embodiments illustrate how to implement the invention with fewer analyzer states, generally meaning fewer components and lower cost. In these cases, the Stokes vectors are not expressed as simple weighted sums, but there is still enough information to determine the change matrix M in the sense that, being a rotation matrix, M only has three independent parameters (such as the well-known Euler angles), while a even a two-detector system measures four signals (more than the three unknowns) during a cycle of polarization modulation. Formulae for data reduction with fewer detectors may involve square roots that are double-valued functions, however, and they may have singular conditions where they are susceptible to noise. These conditions can be handled by continuity requirements on the Stokes-vector motions, recognition and mitigation methods for noise sensitive conditions, and use of additional information, such as stability of the scalar product or angle between the two Stokes vectors, or constancy of the optical power.

Embodiment 1 Two Launch States; Four Analyzer States

A first embodiment, a monitoring system that uses two launch states and four analyzer states, will be described by way of example with reference to FIGS. 1 to 15. As shown in FIG. 1, a launch unit 100 and a monitoring unit 102 are coupled to first and second ends, respectively, of a length of optical fiber 104 which might be from a few meters to more than 40 km long. In the following description, the first end and second end may also be described as proximal and distal, respectively, with respect to the launch unit 100. The first/proximal end will be deemed to be “upstream” and the second/distal end “downstream” even though there may be counter-propagating reflected light and regardless of whether the two ends are physically remote from each other, for example as ends of an optical cable in an optical communications network, or physically co-located, for example where the length of optical fiber comprises two individual fibers “looped back” in the same cable or where the fiber is deployed in a loop around a zone being protected against intrusion, such as an electric power substation. It should also be appreciated that the “ends” may be ends of a portion of a longer fiber, either or both of the launch unit 100 and the monitoring unit 102 being connected to it by, for example, a fiber tap or taps.

The launch unit 100 comprises a linearly-polarized laser source 112, a polarization controller 114 and a timing control unit 116. The light from the laser source 112 is passed through the polarization controller 114 which, under the control of a 1 kHz timing or clock signal CK_(P) (see trace (A) of FIG. 6) from timing control unit 116, modulates the polarization of the light to produce two different polarization states (“launch states”) which alternate with a 50% duty cycle, each cycle having a 1 millisecond period, as shown in trace (B) of FIG. 6. The polarization controller 114 comprises a device which accepts linearly-polarized input light and switches its output between two linearly-polarized states at 45 degrees to each other (which is a 90 degree rotation in a Poincaré-sphere or Stokes-vector representation of the state). A suitable such device is marketed by General Photonics Corporation under the trademark PolaSwitch™.

At the monitoring unit 102, the light that was launched into the proximal end of fiber 104 and has propagated along the length of fiber 104 (past a disturbance location or zone) is received at the distal end by a detection unit 106 and converted to corresponding detection signals which are processed by the processing unit 108 to identify changes consistent with a physical disturbance of the fiber 104 indicative of an intrusion attempt somewhere along its length. Such an intrusion attempt might involve movement of the fiber 104 by someone preparing to tap it and extract data signals propagating therealong, e.g., where the fiber is used for communications purposes, or by an intruder attempting to access a secure area around which the fiber 104 is deployed, perhaps along a perimeter fence or buried in the ground and defining the perimeter. In the case of a data communications application, the fiber 104 might be carrying optical data signals coupled into the fiber by means not shown in FIG. 1, for example at a different wavelength using wavelength-division multiplexing; or the fiber 104 might be a dedicated sensing fiber that is in the same cable as another fiber that is carrying optical data signals.

The detection unit 106 comprises an input lens 118 which receives the light from the distal (downstream) end of the fiber 104 and directs it to each of three beam splitters 120, 122 and 124. The beam splitters 120, 122 and 124 each transmit a portion of the incoming light and reflect a portion to distribute portions of the light to four detectors 126, 128, 130 and 132, respectively, via four polarizers 134, 136, 138 and 140, respectively. (It is noted that the analyzer states of polarization depend not only on the polarizers, but also on the effects of the associated beamsplitters.) The arrangement is such that the incoming light is split and applied to the four polarizers 126, 128, 130 and 132 in approximately equal amounts. Each of the detectors 126, 128, 130 and 132 produces an electrical detection signal whose magnitude is proportional to the optical power that it receives. It alternates between two levels as the launch polarization state is modulated, and these levels may change as the fiber is disturbed. Such an alternating signal is shown as trace C of FIG. 6, one cycle being shown expanded in trace D. Each of the detectors may include an associated amplifier such as a transimpedance amplifier (not explicitly shown), as is common.

The analog electrical detection signals d1, d2, d3 and d4 from detectors 126, 128, 130 and 132, respectively, are supplied to an analog-to-digital converter (ADC) unit 142 in the processing unit 108, where four respective ADCs (not shown) clocked by a local clock signal CK_(D) (trace D in FIG. 6) from local clock source 146 convert them to corresponding digital detection signals D1, D2, D3 and D4 that are then supplied to a processor 144 in processing unit 108.

The frequency of local clock signal CK_(D) is nominally an integer multiple of the frequency of the polarization clock signal CK_(P) (trace A) generated in the launch unit 100. In this specific embodiment, the frequency of the local clock CK_(D) is 50 kHz. Consequently, the digital detection signal for each detector comprises 50 samples in each cycle of the polarization modulation, i.e., 25 samples for each half-cycle, as shown in expanded trace D of FIG. 6.

The operations carried out by the processing unit 106 upon the digital detection signals, and the data sets processed, described in general terms with reference to FIG. 4, will now be described in more detail with reference, as necessary, to other Figures.

4.11 Data Acquisition

Data will be preprocessed in blocks containing an integer number of segments corresponding to respective cycles of the polarization modulation and hence the detection signal when properly registered with the segments. In the embodiment of FIG. 1, each one-cycle segment comprises 50 raw-data points i.e. 50 samples from each detector, and the integer number is 100, so a block contains, for each detection signal, 5,000 points representing 0.1 second of data, i.e., 100 cycles of the polarization modulation. (This is in contrast to the 10 points per cycle used for ease of illustration in FIG. 7 and elsewhere.) There are four detection signals so, in process 4.11, the processing unit 108 acquires 200 samples per cycle, i.e., 20,000 samples per 0.1 second block and stores them as “raw” data 4.21. Data acquisition techniques are generally known to those skilled in this art and so will not be described in more detail here.

4.12 Pre-Processing

Referring also to FIGS. 8 and 9, the pre-processing 4.12 comprises (i) selecting a block of detection signal samples and determining phase offset correction for registration purposes (steps 8.01 and 8.02); and (ii) averaging and reducing selected samples to one sample per half-cycle of each detection signal (Step 8.03) and storing the averaged values (Step 8.04).

4.12(i) Phase Offset Correction and Registration

As mentioned above, the frequency of the clock CK_(D) (trace D) is chosen to be an integer multiple of the polarization modulation frequency of clock CK_(P) (trace A), but such integer multiple will not generally be exact due to inaccuracies in the clocking components, and the phase of each of the analog detection signals d1-d4 will generally drift relative to the ADC clock signal CK_(D) (trace D). To process the “raw” (un-averaged) digital detection signals D1-D4 properly, it is necessary to correct for “phase offset” and ensure that each set of samples selected for averaging (reduction) does not overlap the transitions, and extends only over a relatively unchanging part of the modulated waveform. The aim is to ensure that the right points are averaged to create the averaged detection signal (4.22) for each half cycle of the data. Equivalently, the aim is to refine the selection of the block so that it is properly aligned or in proper registration with the fixed “averaging pattern”.

The “averaging pattern” is a rule that determines which points will be averaged to make the average detection signals for each half cycle of the polarization modulation. This pattern and its registration are illustrated by the example of FIG. 7. For ease of illustration and simplification of description, FIG. 7 shows only 10 raw data points per cycle of the signal from each detector, instead of 50.

Each of the plots (a)-(e) shows the waveform of the detection signal as the launch polarization state is modulated. Signal samples marked with x are intended to be used to create an average value for each half cycle, and those marked with circles are intended to be ignored. The pattern of time positions for the x marks and O marks is the averaging pattern. In different traces, the averaging pattern position is slightly different as a result of phase drift between the ADC clock signal C_(D) and the polarization modulation waveform. The averaging pattern of trace (d) has poor registration or alignment because for each cycle, two of the points selected for averaging (marked with x) are on transitions, and two points selected for rejection (marked with O) are in useful stable parts of the data. Trace (e) has the same measured data points but shows that the processor 144 has shifted the averaging pattern by a single raw data point so that it is in proper registration. The earlier traces (a) through (c) in FIG. 7 show slightly different positions for the digitized raw data points because of a phase drift of the sampling times relative to the polarization-modulation waveform, by 0%, 30% and 60% respectively of the time interval between raw points. However, a single registration of the averaging pattern is acceptable for all of these, and that registration only becomes poor when the phase has drifted from its location in trace (a) by nearly 80% of the raw point interval, as in trace (d).

When the system is being built, the ratio of the frequency of the digitizing clock CK_(D) (from sampling clock 146) to the frequency of the polarization modulation clock CK_(P) from timing control 116 is made close enough to its nominal integer value that, if the drift is sufficiently slow, a single registration of a long periodic averaging pattern will generally suffice for an entire block. In other words, if registration is done well in the middle of the block, then data in the two one-millisecond segments at the beginning and the end of the block will also be sufficiently well aligned with the averaging pattern that transitional effects do not significantly influence the results.

Requirements placed on the four analyzer states of polarization (discussed later) will generally guarantee that at least one of the four time-dependent detection signals d1, d2, d3, d4 has a substantially square waveform with significant amplitude, which can be used for determining the proper registration. Consequently, the processor selects only one of the four detection signals (having the largest amplitude) for use in determining phase offset and registration.

Referring again to FIG. 8, in Step 8.01, the processor 144 selects a contiguous block of raw data 4.21 to be reduced to averaged detection signals 4.22. (This selection is preliminary, and before reduction it may be altered by the phase offset correction, shifting the block's boundaries a small amount to align the data with the averaging pattern. (In this example the averaging pattern is fixed relative to the block boundaries, so shifting the boundaries is the same as shifting the averaging pattern.) When the system begins operation, this step 8.01 selects the first 5,000 raw data points (each “point” here meaning a set of four signal samples at a particular time, one from each of the four different detectors), waiting for the data acquisition process to put enough raw data 4.21 in its buffer if necessary. At later times, each new block begins immediately after the last point of the last block of raw data that was reduced. This rule automatically starts the new block with the same averaging-pattern registration that was used for reducing the previous block, although that may be changed by shifting the boundaries after phase measurement in the next step.

In Step 8.02, the processor 144 measures the phase of the polarization modulation that is reflected in the data, and uses the phase information to adjust or modify the boundaries of the 5,000-point block, if necessary, resulting in an adjusted raw data block. Referring to FIG. 9, which illustrates Step 8.02 in more detail in Step 9.01, the processor takes the four arrays of single-detector signal samples in the block, and computes the standard deviation for each. The array with the largest standard deviation is passed on to the next step for phase analysis, i.e., data from only one detection signal will be used to determine phase offset correction.

In Step 9.02, the processor prepares for phase analysis by computing a single averaged cycle (50 points) from points 1501 to 3500 in the 5,000-point single-detector block, to provide a low-noise sample from which the phase will be determined. The first point in the average cycle is the average of points 1, 51, 101, etc. in the block; the second point is the average of points 2, 52, 102, etc. in the block; and so forth for all 50 averaged points.

In Steps 9.03 and 9.04, the processor finds the phase of the averaged cycle by determining the shift or index offset that maximizes its correlation with a model square wave having two cycles. A model square wave is constructed as an array of 100 values, of which points 1 through 25 and 51 through 75 equal +1, and the other fifty points equal −1. In Step 9.03, the processor computes a common digital cross-correlation function for the averaged cycle and the two-cycle model square wave and in Step 9.04, finds the index of the maximum of this cross-correlation function, as an indication of the relative shift between the two arrays that best aligns them by this correlation measure.

In Step 9.05, the processor interprets this “peak index” (the index of the maximum value of the cross-correlation function) to determine by how many points (if any), and in which direction (earlier or later), the boundaries of the 5,000-point block should be shifted to align the block optimally with the averaging pattern. The phase measured in the previous step indicates the nominal positions in the block of all of the positive-going transitions of the selected detection signal; and this, in turn, indicates the nominal positions in the block of all of the transitions (without regard to direction, i.e. positive-going or negative-going). The exact way that the numerical peak-index value indicates these locations depends on normal choices to be made during programming.

In Step 9.06, the processor shifts the boundaries of the block to align the data optimally with the averaging pattern, comprising the desired sets of samples 5.13 in each half cycle. Many different shifts would qualify to produce optimal alignment, i.e. if a shift by some number n_(p) of points qualifies, then a shift by n_(p)+25 or n_(p)−25 points (half of a cycle either way) will also qualify. The qualifying shift with the smallest magnitude is chosen, whether it shifts the block boundaries earlier or later in the data. For the embodiment of FIG. 1, the shifted boundaries will place the nominal position of a transition at the start of the block, because the averaging pattern is fixed relative to the block and chosen to make that condition optimal for alignment.

Having determined the amount and direction of the phase offset correction using the data for one detection signal, the processor 144 applies the same phase offset correction to the data arrays of all four detection signals so that they are properly registered and ready for the averaging and reduction process of Step 8.03.

4.12 (ii) Averaging and Reduction

Referring again to FIG. 8, for each of the digital detection signals D1-D4, in Step 8.03 the processor reduces the 50 samples in each cycle of the block to 2 points, one for each half cycle, each comprising four values that are the average detection signals of the four detectors, the result of using the averaging pattern that was described earlier with reference to FIG. 7 (see also 5.13 in FIG. 5). Thus, referring to FIGS. 5, 6 and 7, the processor 144 discards samples adjacent each of the transitions, i.e., rising or falling edges, and averages at least some of the remaining samples. As shown in trace D of FIG. 6 and all traces but trace (d) in FIG. 7, in this specific embodiment, the processor 144 ignores a prescribed number of samples (two in these examples) that are substantially coincident with a transition, i.e., where the detection signal magnitude is changing rapidly, and averages the remaining samples in each half-cycle, i.e., where the detection signal magnitude remains relatively constant, to produce one average value per half-cycle for each detector.

In Step 8.04, the processor simply stores all of the averaged detection signals from the block, in the circular buffer 4.22 that is designated for averaged detection signals (FIG. 4) enabling use of the data in Process 4.13 to compute fiber change information.

4.13 Computation of Fiber Change Information

The first step in process 4.13 is to determine, from the sets of four detection signals during each half cycle, the Stokes vectors for the output polarization states in each cycle. The entire Stokes 4-vector may be determined but only the three directional components (excluding the total power component) may be needed; so in the following description only three are computed and used. Conditions placed on the analyzer states in this embodiment (as described later) allow each of the four Stokes components of the light that exits the fiber to be computed as a weighted sum of the four detection signals. In other words, the Stokes 4-vector is S=WP, where P is a column vector of the powers of the four detection signals, and W is a 4×4 matrix of the weights. The desired unit Stokes 3-vector s is then computed in the usual way from three of the Stokes parameters (three components of S, ignoring the power component S₀). Computation of the Stokes 3-vector, determination of the weighting matrix W and more-complex algebraic computations for calibration where embodiments have fewer than four analyzer states, will be described hereinafter.

The change-information computation procedure (4.13) will now be described in more detail with reference to FIG. 10 which shows seven steps. Details of the steps performed by processor 144, and the equations used, are as follows:

Step 10.01: The processor reads two average sample values of each of the four detection signals, one for each half cycle, from the buffer; thus, in each cycle there will be eight sample values, the first four corresponding to the first half of a cycle of polarization modulation, and the second four corresponding to the second half cycle.

Step 10.02: For each of the two half cycles, the processor 144 derives a column vector P from the four averaged detection signals and the weighting matrix W used in the formula S=WP, described above, to compute the three Stokes parameters S₁, S₂ and S₃.

Step 10.03: For each of the sets (S₁, S₂, S₃) of three Stokes parameters computed in the previous step, the common normalizing formula

$\left( {s_{1},s_{2},s_{3}} \right) = \frac{\left( {S_{1},S_{2},S_{3}} \right)}{\sqrt{\left( {S_{1}^{2} + S_{2}^{2} + S_{3}^{2}} \right)}}$ is employed to compute the three components of the corresponding unit Stokes 3-vector. This yields the unit Stokes vectors s_(a) and s_(b) for the “a” and “b” halves of the polarization modulation cycle. (see 5.16, FIG. 5)

Step 10.04: To facilitate a particularly convenient method of using two pairs of Stokes vectors to compute the rotation matrix that transforms one pair into the other, the processor constructs a “state matrix” T. Thus, the processor first uses s_(a) and s_(b) to define a third unit Stokes vector from their cross product, s_(c)=s_(a)×s_(b)/|s_(a)×s_(b)|, then constructs T by using these three vectors as columns:

$T = {\begin{bmatrix} s_{a\; 1} & s_{b\; 1} & s_{c\; 1} \\ s_{a\; 2} & s_{b\; 2} & s_{c\; 2} \\ s_{a\; 3} & s_{b\; 3} & s_{c\; 3} \end{bmatrix}.}$ This state matrix T contains the same information as the two Stokes vectors s_(a) and s_(b), but is an example that provides a convenient method of computation in the next step.

Step 10.05: The change matrix M for the current cycle (see 5.17, FIG. 5), representing the change in the fiber polarization transformation since the time of the previous cycle, is computed by multiplying the present state matrix T (on the right) by the inverse of the previous state matrix T₀, so that M=T T₀ ⁻¹.

Step 10.06: The information in the change matrix M is converted into a step vector c (see 5.15, FIG. 5) using formulae based on equations 18 and 19 of section 3-2 in Rotations, Quaternions and Double Groups by Simon Altman, Dover, 2005. These formulae are equivalent to computing c as

${c = {\frac{\theta}{\sin\;\theta}\mu}},$ where the three components of the vector μ are computed as

${\mu_{x} = \frac{M_{32} - M_{23}}{2}},{\mu_{y} = \frac{M_{13} - M_{31}}{2}},{\mu_{z} = \frac{M_{21} - M_{12}}{2}},$ and the angle θ is computed as θ=cos⁻¹[½(M₁₁+M₂₂+M₃₃−1)]. (When θ is small, if the processor 144 has difficulty evaluating the singular expression θ/sin θ, a second-order approximation 1+θ²/6 may be used for θ/sin θ.)

Step 10.07: The step vector c(t₀,t₁) computed for the change between times t_(o) and t₁ is added to the change-space position vector C(t₀) for time t₀ (see 5.19, FIG. 5) to construct the new change-space position vector C(t₁) for time t₁. This new vector C(t₁) is appended to the sequence in the buffer for change information 4.23, and the processor returns to step 10.01 to read the next two average sample values of each of the four detection signals.

The foregoing method of computing the Stokes vectors is suitable for the embodiment of FIG. 1, with four different analyzer states. It may not be suitable for other embodiments that may have fewer analyzer states. But the subsequent steps (after determination of the Stokes vectors) are examples that can be applied to sequences of Stokes vector pairs that have been determined by other embodiments employing different hardware and/or other computational methods.

Depending on the processor and programming choices, it is possible that one or more components of the change-space vector C may grow large enough over time to cause a numerical overflow problem or loss of resolution. If that is a danger, Step 10.07 may also include a check on the magnitude of the change-space vector's components. If some prescribed threshold is exceeded, the processor can correct the excess by subtracting one vector (such as the last vector in the buffer) from all the vectors in the buffer, restoring them all to a region closer to the origin in change space, without changing their relative positions.

This operation must be done in a way that is coordinated with subsequent processes 4.15 and 4.16, so that they are prevented from computing a measure from any “mixed” set of data read partly before the correction operation and partly after, and therefore having some incorrect relative positions.

As described hereinbefore with reference to FIG. 4, the change-space-distance computation process 4.14 (Measure-1) and the spectral power computation process 4.15 (Measure-2) determine two different measures of disturbances from the stored Change Information 4.23. In a separate Decisions and Reporting process 4.16, the measures are evaluated with reference to preselected criteria to determine whether a reportable disturbance has occurred. These three processes 4.14, 4.15 and 4.16 will now be described in more detail with reference to FIGS. 11, 13 and 15.

FIG. 11 illustrates the change-space distance computation process 4.14 for deriving Measure-1 and its history. This process depends on a user-defined time interval t_(D), and a user-defined threshold that represents the maximum amount of net rotation (e.g. in radians) on the Poincaré sphere over any time interval of length t_(D) that will be tolerated without reporting a disturbance. The time t_(D) might, for example, be set to 10 seconds, and the threshold to 0.2 radians. Then, if the fiber's change, over any 10-second period, exceeds 0.2 radian of rotation, a disturbance will be reported. The process of comparing these thresholds will be described by FIG. 15. The steps of FIG. 11 are as follows:

Step 11.01: Before proceeding, the processor ensures that data for a required interval up to time t_(D) exists in the buffer containing change information 4.23. This needs to be invoked only at startup of the system.

Step 11.02: The processor retrieves two change-space position vectors C(t-t_(D)) and C(t) from the buffer containing change information 4.23. One is for the present time t, and the other is for a time t_(D) ago, which is the time t-t_(D).

Step 11.03: The processor computes the distance between these two vector positions as the norm of the vector created by their difference. This is a reasonable estimate of the net angle of rotation in Stokes space that characterizes the change in the fiber transformation between the two times, provided the angle is not too large and the change-space motion is not made up of too many steps. Even if it is not a perfect estimate of the angle, it is a good measure for using with reporting criteria.

Step 11.04: The processor stores the computed distance in the buffer for Measure 1 Histrory (4.24).

Step 11.05: The processor waits for a user-set time period, and then repeats the test. The user-set time period will generally be a small or moderate fraction of the time t_(D) over which the distance is measured, so that successive time intervals overlap and the system is not likely to miss a reportable disturbance just because of timing. For example, the wait time might be 0.3 second.

FIGS. 12A, 12B and 12C are graphs showing the x, y and z coordinates, respectively, of the change-space position vector over 30 seconds, while FIG. 12D is a graph of the distance moved in change space in the preceding three seconds, for different times

The spectral power computation process 4.15 for Measure-2 also operates on blocks of data that have a fixed length equivalent to a time interval of 0.1 second, over which 100 change-space position vectors are acquired. Its “integrated power” computation depends on a user-defined set of weights for different frequencies, selected on the basis of the relative importance of different frequencies in detecting disturbances that merit reporting. The manner in which such selection will made will be known to those skilled in the art. The process 4.15 for computing Measure-2 and its history will now be described in more detail with reference to FIG. 13. Thus:

Step 13.01: The processor reads 100 change-space position vectors from the buffer containing change information 4.23, beginning at a location in the circular buffer indicated by a pointer conveniently called the “change-space pointer”. It may be necessary to wait until at least 100 vectors have been created. The vectors comprise three one-dimensional arrays, each having 100 points and representing change-space position along one of three mutually perpendicular axes x, y and z.

Step 13.02: The processor computes a power spectrum for each of the three arrays. It does so by first applying a window function, specifically a common Hanning window, to each array, to limit the spectral noise and anomalies that can arise when the end of the array does not match the beginning well. After applying the window function to each array, the processor transforms the resulting data with a Fourier transform, discards the redundant second half of the resulting array points, and squares the absolute values of the complex components to create the power spectrum array. These operations are known to those skilled in the art and described in many books such as The Fast Fourier Transform and its Applications, by E. Oran Brigham, Prentice-Hall, 1988.

Step 13.03: The processor adds the three power spectra to create a single array. This is an “isotropic” power spectrum in that an arbitrary rotation of all the change-space vectors before the computation (e.g. by applying the same 3×3 rotation matrix to every vector) will not change its value for any frequency. The d.c. or “zero-frequency” component is discarded as it will not be used in the subsequent analysis.

Step 13.04: The processor creates an “integrated power” value as a weighted sum of the components of the isotropic power spectrum array. The weighting is predetermined by the user. In practice, the processor may be programmed to allow the user to select (or not select) an option of frequency weighting, which causes each component to be multiplied by a number that is proportional to the frequency that it represents, then select an upper frequency and a lower frequency, which causes all of the components corresponding to frequencies below the lower value or above the upper value to be multiplied by zero (i.e. set to zero). After this weighting (multiplication), all the components of the resulting array are added. As is common, the result can be scaled, if desired, for example by dividing by the sum of the weights and/or by multiplication by a factor that controls the units of the resulting value.

Step 13.05: The processor stores the resulting integrated power value in a circular buffer with the capacity for holding all of the computed values over a recent time interval with a duration, for example 10 seconds, that has been preset by the user. The reason for this is that the next step will decide whether to report a disturbance, not just on the current value of the integrated power, but on properties of its recent history.

FIGS. 14A, 14B, 14C and 14D show, respectively, the power spectra for the three components/directions x, y and z and the isotropic power spectrum obtained by combining them.

Step 13.06: The processor records the number of such integrated power values in the recent history buffer which exceed a threshold value that has been preset by the user, then determines whether the number is greater than a preset number also set by the user. If the number is greater than the preset number, the processor reports a disturbance, i.e., outputs an alarm via output/alarm unit 110.

Step 13.07: The processor advances the change-space pointer by 50 samples, and returns to Step 13.01 and repeats the process. By advancing 50 points instead of 100, this step ensures that the next set of 100 change-space vectors, read in step 13.01 and subsequently processed, will overlap the set before it by 50 percent.

This overlap provides a simple method of ensuring that changes in relatively short time intervals can influence the integrated power value. The Hanning window used in Step 13.02 severely reduces the signal for times near either end of each set of 100 points. However, in the event that a brief disturbance happens near the end of one interval, so that it is suppressed, the overlap ensures that it will also be represented near the middle of another interval, where it is not suppressed. (It will be appreciated that, if desired, a different window function could be used instead of the Hanning window.)

FIG. 15 illustrates the decisions and reporting process 4.16 for evaluating Measure 1 History data 4.24 and Measure 2 History data 4.25 and determining whether or not to report a disturbance. This process depends on user-defined thresholds as previously described for FIG. 11 and FIG. 13. The steps of FIG. 15 are as follows:

Step 15.01: The processor retrieves from the buffer containing Measure-1 History the most recent computed distance d.

Step 15.02: The distance, d, is then compared to the user set threshold. If d exceeds the threshold a disturbance is reported.

Step 15.03: The processor retrieves from the buffer containing Measure-2 History a user defined number of recent values. Each one of these values is compared to a user set threshold. The fraction of values that exceed the threshold is computed.

Step 15.04: The computed fraction is compared to a user set threshold. If it exceeds the threshold a disturbance is reported.

Generally, in each of the embodiments, the Change Information 4.23 will take a similar form, being substantially independent of launch and analyzer states yet dependent upon fiber transformation. Accordingly, the Change Information can be processed using the above-described frequency-domain and time-domain processing procedures for determining reportable disturbances

In order for the calibration matrix W to exist and work properly, it is necessary for the four Stokes 4-vectors that characterize the four analyzer states to be linearly-independent. This can be proven by noting that the equation P=W⁻¹S defines the four detector powers as a linear sum of the Stokes 4-vector components, and the matrix designated as W⁻¹ necessarily exists (even if it is not invertible and its inverse, W, does not exist). Because the P components are just powers or the S₀ terms of the Stokes 4-vectors for light passing through an analyzer, each row of W⁻¹ is simply the top row of the Mueller matrix that defines the polarization transformation done by the corresponding analyzer (134, 136, 138,140). But these rows are identical to the Stokes 4-vectors that describe the analyzers. If they are not linearly independent, the matrix W⁻¹ is singular and has no inverse. But if they are linearly independent, then the inverse W exists, i.e. there is a calibration matrix. This requirement on the analyzer states is broad, simple and easy to meet without overt requirements on precisely what the states are (e.g. linear, circular, linear at 45 degrees as in other methods).

A system with four analyzer states characterized by Stokes 4-vectors that are linearly independent will automatically meet the simpler requirement that there should be at least two analyzer states that are polarized and are characterized by Stokes 3-vectors that are linearly-independent. It should be noted that a Stokes 3-vector can characterize an analyzer state so long as it is a polarized state, even though the Stokes 3-vector does not include the extinction-ratio information that a Stokes 4-vector characterization would include. One of the four analyzer states can be completely unpolarized (thus having no associated 3-vector), but not two.

One method for constructing the calibration matrix W is implied by the proof of its existence that appears above. The matrix W⁻¹ is simply constructed so that each row is the top row of the Mueller matrix that defines the polarization transformation done by the corresponding analyzer, or so that each row is the Stokes 4-vector that characterizes the corresponding analyzer state. Then its inverse is computed to get W. Alternatively, the matrix W can be calibrated by methods such as described by Mikhailov et. al, in “Robust Remote Calibration of Fiber Polarimeters,” in Optical Fiber Communication Conference, OSA Technical Digest (CD) (Optical Society of America, 2011), paper OWC4.

In order to achieve a good signal-to-noise ratio, it is valuable for the four Stokes 4-vectors representing the analyzer states to be, in some sense, “robustly” linearly independent. If the 4-vectors should be too close to a condition of dependence, then, even though the matrix W⁻¹ is well-behaved, its inverse W might be nearly singular, in a way that causes the four weights (one row of W) used for computing at least one of the Stokes vector components to include large positive and negative numbers. The weighted sum could then include a small difference between large multiples of two or more detection signals, and would be very sensitive to small noise-induced changes in the measured detection-signal values. The Stokes 4-vectors for the analyzer states should therefore be far enough from a condition of linear dependence to support a signal-to-noise ratio that is sufficient for the particular application.

Similar requirements apply to the linear independence requirement on the Stokes 3-vectors characterizing the two launch states of polarization. These two states should also be sufficiently polarized to support a good signal-to-noise ratio. As an example, a commercial four-analyzer product from General Photonics called PolaDetect™ has four analyzer states of polarization, all with fairly high extinction, that may be described as approximately x-polarized, y-polarized, 45-degree polarized and circularly polarized. These states are not perfect or perfectly aligned or accurate, nor do they need to be. The product comes with a calibration matrix provided by the vendor. These states have Stokes 4-vectors that are quite different, and may be called robustly linearly independent.

It will be appreciated that the invention is not limited to the above-described embodiment described with reference to FIG. 1, with its two launch states, four analyzer states, and preferred way of processing the detection signal data to determine measures of fiber disturbances that are substantially independent of the launch states and analyzer states of polarization. A number of other embodiments having different combinations of launch states and analyzer states will now be described with reference to FIGS. 16 to 20. These embodiments also will use the detection signal data to compute Stokes vectors, but the methods for computing these Stokes vectors will be different. The subsequent processing, to derive measures of change that are substantially independent of the launch states and analyzer states and to determine whether the changes are reportable, may be the same. Thereafter, modifications to the hardware and processing procedures will be described, some of which apply to specific hardware embodiments and others of which will be applicable to each of the embodiments regardless of the specific launch and analyzer states employed.

The following descriptions of embodiments 2 to 5 use some symbols and notations that differ from those used in the description of embodiment 1; however, their correspondence with those used for the first embodiment will be apparent from the context.

Embodiment 2 Two Launch States; Two Analyzer States Plus an Unpolarized Detector

A second embodiment, which uses two launch polarization states and two polarized detection states plus a polarization-insensitive detector, will be described with reference to FIG. 16. It illustrates the advantage of reduced fading, and the advantage of liberty in selecting the polarization states that are launched into the fiber and selected by the detectors.

In FIG. 16, a launch unit 100 and a monitoring unit 102 are shown coupled to proximal and distal ends, respectively, of an optical fiber 104 which might be from a few meters long to more than 50 km. Light in two different states of polarization (SOP), launched into the proximal end of fiber 104 by launch unit 100, travels through the fiber 104 and is received at the distal end by the monitoring unit 102 and analyzed for changes consistent with a physical disturbance of the fiber 104 indicative of an intrusion attempt somewhere along its length. Such an intrusion attempt might be movement of the fiber 104 by someone preparing to tap it or another fiber in the same cable, and to extract data signals propagating therealong, e.g., where the fiber is used for communications purposes, or by a person, an intruder, attempting to access a secure area around which the fiber 104 is deployed, perhaps along a perimeter fence or buried in the ground and defining the perimeter. For convenience, the following description will refer, where appropriate, to the data communications application. In such an application, the fiber 104 might be carrying data by means not shown in the figure, such as by light at a different wavelength using wavelength-division multiplexing, or it might be a sensing fiber that is in the same cable structure as another fiber that is carrying data.

The launch unit 100 comprises a linearly polarized laser source 112, a polarization controller 114 and a timing control unit 116. The light from the laser 112 is passed through the polarization controller 114 which, under the control of a 1 kHz timing or clock signal from timing control unit 116, alternately rotates the state of polarization of the light by two different amounts to produce two different linear polarization states. The light in these two states is assumed in this embodiment to be essentially 100% polarized. The timing control unit 116 switches the polarization controller 114 with a 50% duty cycle between the two polarization states, cycling once every millisecond. The polarization controller 114 comprises a device such as the General Photonics PolaSwitch™ product, which accepts linearly polarized input light and switches its output between two linearly polarized states at 45 degrees to each other under control of an external electrical signal.

In each 1 millisecond cycle, the timing unit 116 momentarily increases the output power of the laser 112 to provide a synchronizing pulse to “inform” the monitoring unit 102 about the phase of the polarization modulation in the associated one-millisecond period. The timing control unit 116 positions the synchronization pulse at or near a transition between the two states, for example on the rising edge, so as not to waste stable moments that are valuable for signal measurement and averaging. Thus, the launch unit 100 modulates the polarization state at 1 kHz, while at the same time embedding a timing sync pulse.

Referring again to FIG. 16, the detection unit 106 comprises an input lens 118 which receives the light from the distal (downstream) end of the fiber 104, two beam splitters 120 and 122, respectively, three detectors 126, 128 and 130, respectively, and two polarizers 136 and 138, respectively. The beam splitters 120 and 122 are disposed in the light path in succession, each transmitting a portion of the incoming light and reflecting a portion. In the case of beam splitter 120, the reflected light is conveyed directly to detector 126, whereas the reflected light from beam splitter 122 is conveyed to detector 128 via polarizer 136. Finally, the light transmitted by beam splitter 122 is conveyed via polarizer 138 to third detector 130. The arrangement is such that the incoming light is split and applied to the two polarizers 136 and 138 and the detector 126 in approximately equal amounts. The polarizers 136 and 138 are assumed in this embodiment to have essentially perfect extinction. Each detector 126, 128 or 130 is designed to produce an electrical output that is proportional to the optical power that it receives. Typically, each will comprise a photodiode and a transimpedance amplifier, as is commonly used for light detection.

The electrical outputs from detectors 126, 128 and 130 are each supplied to a processing unit, 108, which includes a digitizer unit 142 which comprises three corresponding analog-to-digital converters (ADCs, not shown) for converting the electrical outputs to corresponding digital signals which are supplied to a processor unit 144, which may comprise a microprocessor and/or digital signal processor. The first detector 126 measures unpolarized power and its output is used by the processor 144 for normalizing the outputs of the other two detectors.

The electrical signal from (unpolarized) detector 126 also is supplied to synchronizer unit 146 that uses a phase-locked loop to generate a clock signal that is synchronized to the 1-kHz periodic timing by using the synchronization pulses applied to the light signal in the launch unit 100. The frequency of this ADC clock signal may be 50 kHz as in the first embodiment but could be different, if desired. With that in mind, the following description will use a frequency of 10 kHz.

The synchronizer unit 146 comprises a bandpass filter which filters out unwanted noise before applying the filtered signal to a phase-locked loop (PLL) circuit, which locks onto the periodic pulses generated and generates the ADC clock signal, which is supplied to digitizer unit 142 to clock the separate analog-to-digital converters which convert the signals from detectors 126, 128 and 130, respectively. The corresponding converted digital signals are supplied to the processor 144.

In operation, the electrical signal from each of the detectors 126, 128 and 130 is sampled at a sampling rate ten times the rate at which the polarization state is switched. Thus, during each 1-millisecond cycle of the signal as detected in the detection unit 106, the processor 144 will sample the amplitude of the power of the light incident upon each of the three detectors 126, 128 and 130 five times for each half cycle, i.e., for each polarization state. The output of the bandpass filter located in the synchronizer 146 will yield a train of pulses that are synchronized to the 1 kHz modulation of the incoming light, this train of pulses being used by the synchronizers internal PLL to drive the digitizer unit 142 with the same phase of the signals from the two launch states.

For each of the detectors 126, 128 and 130, the processor 144 will measure and average amplitude measurements for only four of the five sample intervals for each launch state, ignoring the first sample in each half of the polarization modulation cycle which coincides substantially with a state transition, and in one case with the synchronizing pulse.).

The processor 144 will normalize power values of the detection signals from the two “polarized” detectors 128 and 130 by dividing them by the power of the signal measured by the “unpolarized” detector 126 and multiplying each of the resulting polarized detector signals by an appropriate calibration factor to create a “normalized” power value, such that a maximum value of 1.0 will occur for any of the detectors if the polarization of the incoming light is exactly aligned with the transmission axis of the polarizer preceding that detector. This normalization avoids variations caused by laser power drift or the fiber loss, in known manner.

Once these normalized power values have been obtained, the processor 144 processes them to determine the changes of state in the fiber 104 over time, before using this information to assess whether or not a disturbance signifying a reportable intrusion attempt has occurred, or to deduce other information.

To determine information about the fiber's change that is independent of the launched and detected polarization states that are used to probe the fiber, we will describe a process of first determining how the Stokes vectors for the two output states have moved during the change, and then deducing how the fiber has changed.

Assume now that s is the Stokes vector for a beam of 100% polarized light, and a detector that receives that light beam is preceded by a polarizing filter that passes the polarization represented by a Stokes vector d and rejects the orthogonal state represented by −d. Let the normalized signal seen by the detector be m. The relationship between these variables is m=½(1+d·s)  (e3)

Here the dot symbol represents a standard scalar product of vectors. This follows from equation [3.11] of the paper, “PMD fundamentals: polarization mode dispersion in optical fibers,” by J. P. Gordon and H. Kogelnik, Proc. Natl. Acad. Sci. USA, Vol. 97, No. 9, pp. 4541-50 (2000), the contents of which are incorporated herein by reference.

For descriptions of how to process the data, we will generalize equation (e3) using the following notation. Let m_(ij) represent the normalized power on polarized detector i when polarization state j is launched into the fiber. Thus, i and j can each range from 1 to 2, and m₂₁ is the normalized power measured on the second detector when the first launch state is applied. Let d_(i) be the Stokes vector that represents the polarization state associated with the i^(th) polarized detector, and let s_(i) be the unit Stokes vector that represents the fiber's output polarization state when the i^(th) launch state is launched into it. Then equation (e3) implies that, during each 1-millsecond cycle of the polarization states, the four normalized power values that the system measures are related to the output Stokes vectors and the detection Stokes vectors by the following equations: d ₁ ·s ₁=2m ₁₁−1  (e4) d ₁ ·s ₂=2m ₁₂−1  (e5) d ₂ ·s ₁=2m ₂₁−1  (e6) d ₂ ·s ₂=2m ₂₂−1  (e7)

Here we have rearranged equation (e3) to put the scalar product on the left. The four m values are measured, and we assume that the two d vectors are known, so the only unknown quantities are the two unit s vectors. Since these are unit vectors, each vector comprises only two unknown variables rather than three: knowing two of the components will tell us the magnitude of the third vector component, because the sum of the squares of all three components is 1. Determination of the sign of the third component can be done with help from one additional requirement, that the scalar product between the Stokes vectors representing the two output states is constant: s ₁ ·s ₂ =K _(S)  (e8)

Here the constant K_(s) is calibration information that can be pre-determined for the system, then used for analysis of the data.

For a more explicit explanation of how this defines the x and y components and the z sign relationships, and of some uncertainty that remains for this sign, let us define the x direction in Stokes space as the direction parallel to the vector d₁ for the first detection state. Let us define the z direction as the direction of the vector cross product d₁×d₂, and then let us define the y direction as the direction of the vector cross product of a unit vector in the z direction with a unit vector in the x direction. (Here it is assume that the two detection Stokes vectors are neither parallel nor anti-parallel, i.e. they are linearly independent, so that their cross product creates a non-zero z vector.) With axes thus defined, d₁ and d₂ both lie in the x,y plane. Then, it is clear that equations (e4) through (e7) comprise four equations in the four unknowns that are the x and y components of S₁ and s₂. They say nothing about the z components of these two vectors, because the scalar products with d₁ and d₂ annihilate those components. They are linear equations in the four unknowns, and can be solved by standard matrix determinant formulae that are found in many textbooks, or by elementary vector algebra. In fact, the four equations comprise two independent pairs: equations (e4) and (e6) comprise one pair that can be solved without reference to the other two, and equations (e5) and (e7) comprise the other pair. A vector solution for the components of S₁ that lie in the x,y plane (or the plane of d₁ and d₂) is (s _(1x) ,s _(1y),0)=c ₁ d ₁ +c ₂ d ₂  (e9) where the coefficients c₁ and c₂ are

$\begin{matrix} {{c_{1} = \frac{{2\; m_{11}} - 1 - {K_{d}\left( {{2\; m_{21}} - 1} \right)}}{1 - K_{d}^{2}}},{c_{2} = \frac{{2\; m_{21}} - 1 - {K_{d}\left( {{2\; m_{11}} - 1} \right)}}{1 - K_{d}^{2}}},} & \left( {e\; 10} \right) \end{matrix}$

and where the constant K_(d) used to define them is the detection Stokes vector scalar product, K _(d) =d ₁ ·d ₂  (e11) Similar equations give the components s_(2x) and s_(2y) of s₂ that lie in the x,y plane.

While equations (e4) through (e7) say nothing about the z components of the two output Stokes vectors, adding the requirement that they are unit vectors will tell us the magnitudes of these components: |s _(1z)|=√{square root over (1−s _(1x) ² −s _(1y) ²)},|s _(2z)|=√{square root over (1−s _(2x) ² −s _(2y) ²)}  (e12)

The signs of these components are not determined, but the product of their signs generally is determined by using equation (e8), because if both z-magnitudes are non-zero, then changing the sign of one without changing the other will change the value of the scalar product. Changing the signs of both will not affect equation (e8), i.e. this equation cannot discriminate between two mirror images of the pair of Stokes vectors. However, for many applications (including most security applications) this simply does not matter. What would matter is if the computational method caused one or two large z components to change sign suddenly, giving the false appearance of a sudden large change in the fiber. This can be prevented whenever new output Stokes vectors S₁ and S₁ are computed with equations (e4) through (e8) by applying a simple continuity rule: every time z-components are computed for a new successive short time interval, the signs are chosen to minimize the rotation angle corresponding to the change. (Instructions for computing the rotation angle are given further below.) Alternatively, we can use a similar rule that the signs of the two new z components are chosen, compatibly with equation (e8), in a way that minimizes the largest z-component change for the two vectors. (In the event of equality, we can chose to preserve the sign of the z component of s₁.) With either rule, the Stokes vectors are fully determined, and the effect of inadvertent reflections of the two computed Stokes vectors (reversals of both z components) will be minimized. An erroneous reflection may happen due to noise in the detector signals, but it will be when both z components are near zero so that the change is not large, and this will be harmless in most applications.

Having determined the two output Stokes vectors before and after the change, we can now finish computing change properties of the fiber that are independent of the launch and detection states, by finding the rotation that moves both initial output Stokes vectors to their new positions on the Poincaré sphere. A procedure for this was shown above, in connection with equations (e3) and (e4). The resulting matrix M is the change matrix discussed previously, and the polarization transformation that it represents does not depend on the particular launch and detection states that were used to measure it, but it depends only on the change in the fiber. (As noted before, the actual numbers in the matrix depend on whatever basis has been chosen for Stokes space, and that has been chosen, in this example, with the x, y and z axes having a particular convenient relationship to the detection states. But that does not comprise dependence for our purposes, and we could change to any other arbitrary basis without changing the essential properties of a set of matrices M that are measured in this way.)

The computational methods described here allow the change matrix to be computed between any two fiber transformations. As with most measurements, noise or small errors in the raw data (the measured detector signals) may cause errors in the result. In this particular embodiment of the invention, there are conditions under which the effects of electrical or optical noise may be enhanced to a level that is unacceptable in some sensing applications where very small changes need to be detected. It may be valuable, therefore, to determine when such a condition occurs, and compensate for it. In our chosen coordinate system, the Stokes vectors for the two detection states have no z components. Suppose the output vector s₁ from the fiber lies in the x,y plane at some moment, i.e. it also has no z component; and suppose the fiber is disturbed in a way that changes s₁ in the z direction so that, to first order in a small rotation, there is no change in its x and y components. (That will be the case for rotation of the sphere about any axis that lies in the x,y plane.) This will create, to first order, no change in the values of m₁₁ or m₂₁ in equations (e4) and (e6), because the scalar products with d₁ and d₂ are insensitive to changes in the z component of s₁. If s₂ also lies in the x,y plane, such a rotation will also have no first-order effect on the other two measured signals, m₁₂ and m₂₂. Stated differently, the derivatives of all four measured signals with respect to the rotation angle are very small, when both output Stokes vectors and the rotation axis are all close to the x,y plane. Because we are inverting this problem to compute the motion of the output Stokes vectors from the changes in the measured detector signals, this means that small changes in the measured signals (e.g., due to noise) will make large changes in the computed z-motion of the computed Stokes vectors.

The condition of enhanced noise sensitivity described here is somewhat infrequent because it requires a coincidence (both output Stokes vectors are nearly in the x,y plane), and it is easy to remedy for applications where the resulting noise is too high, e.g. where it would create false alarms in a security system. In a preferred embodiment, the signal processing simply detects when the output Stokes vectors are close to the x,y plane, and the method of processing is altered to prevent false alarms due to noise, in any of a number of ways. A preferred method of altering the processing is to compute the changes in the z components of the Stokes vectors, but then attenuate these changes before passing them on to the next stage of processing, i.e. before computing the change matrix for the time interval in question. Also in a preferred method, the attenuation of the z-component change is done to a degree that depends on magnitudes of the two z-components of the Stokes vectors before and after the change. This takes advantage of the fact that the noise sensitivity decreases as the z-components of the Stokes vectors become more distant from zero.

The dependence of the attenuation on the z-component value is chosen on the basis of the actual noise on the detectors and the alarm processor settings, in a way that is designed to attenuate sufficiently to prevent false alarms but not to attenuate significantly more than is necessary. In this example of an embodiment we calibrate by first measuring the magnitudes of the Stokes vectors continuously (once every millisecond) while twisting and bending the fiber slowly, to make the Stokes vectors wander over the sphere. For each ½ second of data, we then compute both the average magnitude of the z-component of each Stokes vector, and the noise, defined as the root-mean-square deviation of this magnitude from its mean. These pairs of numbers are all plotted on a graph, and from the graph we estimate the “baseline noise” as the average noise when the magnitude of the z-component is 0.2. Then we choose a set of smaller z-component values for which the noise is higher, and estimate a factor for each value that will reduce its associated noise to the baseline value. These factors are then incorporated into the processor's memory as a look-up table, and used to attenuate during operation.

Attenuation of each change for a Stokes vector is done by taking the actual measured vectors (with any noise-induced errors) before and after a change, s=(s_(x), s_(y), s_(z)) and s′=(s′_(x), s′_(y), s′_(z)) respectively, then creating an altered second vector with reduced changes, defined as s′_(a)=(s′_(x), s′_(y), s_(z)+f[s′_(z)−s_(z)]). Here f is the attenuation factor assigned, by interpolation with the look-up table, to the z-component value that is the average of s_(z) and s′_(z). (This operation on s′ to produce s′_(a) will change the magnitude slightly, so s′_(a) should be re-scaled to make it a unit vector before using it, by dividing the vector by its magnitude.) This process is done for each of the two Stokes vectors to create s₁′_(a) and s₂′_(a). Then the altered set of vectors s₁, s₂, s₁′_(a) and s₂′_(a) is passed to the routine that computes the change matrix M, instead of the actual set s₁, s₂, s₁′ and s₂′.

Embodiment 3 Two Launch States; Two Analyzer States

A third embodiment of the hardware is illustrated in FIG. 17. It is similar to the embodiment described above in connection with FIG. 16, except for three specific differences with advantages that can reduce the cost in particular applications, and that can mostly be implemented independently of each other in different combinations to suit a particular application.

A first difference is that the synchronizer 146 that tracks the polarization modulation frequency and phase is eliminated, and replaced by a direct connection of the timing control unit 116 to the digitizer block 142. This is especially suitable for applications where the fiber 104 is short, or where it is long but looped back, so that the launch unit 100 and the monitoring unit 102 are close together and a direct connection is easy. Also, the timing control unit 116 no longer causes periodic higher-intensity pulses of the laser 112 to act as synchronizing markers, and the laser 112 is simply held at steady power.

A second difference, not explicitly apparent from FIG. 17, is that the light launched into the fiber 104 may have a significant unpolarized component (it is not necessarily 100% polarized), and the polarizers 136 and 138 may have significantly imperfect extinction ratios. In this case, the degree of polarization of the light, and the extinction ratios, are assumed to be stable, and to be known. A third difference is that the polarization-insensitive detector 126 in FIG. 16 is eliminated, along with its associated beamsplitter 120 and the associated analog-to-digital converter in the digitizer block 142. In this case, the optical power is assumed to be known by other means, such as a prior calibration combined with a laser unit 112 that is known to be stable.

For this embodiment, equation (e3) will be modified to reflect the partial polarization and extinction. We can derive the modified equation in a standard way by creating a Mueller matrix for the detector with partial extinction, and applying it to the four Stokes components that describe partially polarized light. However, another simple approach is presented here. A partially polarized light beam behaves like an incoherent sum of two beams, one unpolarized and another fully polarized, with the power divided between them in a way that depends on the degree of polarization. Also, a filter with imperfect extinction passes an amount of light from a beam (whether polarized, unpolarized or partially polarized) that equals the sum of what would go through two filters, one unpolarized and the other polarized with perfect extinction. The two hypothetical filters include attenuation values that depend on the extinction ratio of the real filter that they mimic. Thus the total power P_(t) that is detected can be written as a sum of four powers: P _(t) =P _(uu) +P _(up) +P _(pu) +P _(pp).  (e13)

Here the subscripts u and p represent unpolarized and polarized respectively, and the first subscript refers to which of the two hypothetical detectors sees this power, while the second refers to which light beam is being detected. The first three terms on the right of equation (e13) are all constant, independent of the relative orientations of the polarized parts of the light beam and the detector filter. For example, P_(up), the power detected by the hypothetical unpolarized detector from the polarized component of the light, will not vary as the fiber is disturbed. Only the last term P_(pp), representing detection of the polarized part of the light with the polarized part of the detector, varies as the fiber's transformation varies, and it behaves exactly like equation (e3), except for a scale factor that accounts for the fraction of the light that is polarized and the attenuation of the hypothetical polarized detector. As a result, the fraction of m of the total power in the beam that is detected by a particular detector depends on the output unit Stokes vector s of the polarized part of the light in the following way:

$\begin{matrix} {{m = {{\frac{\alpha}{2}\left( {1 + {d \cdot s}} \right)} + \beta}},} & \left( {e\; 14} \right) \end{matrix}$

Where α and β are numbers that can be determined by twisting and bending the fiber (or otherwise changing s without altering the optical power) to find both the maximum of m (equal to α+β, when s is parallel to d) and the minimum (equal to β, when s is anti-parallel to d); or by straightforward computations from measurements of the extinction ratio and loss of the particular detector and the degree of polarization of the light. The equivalents of equations (e4) through (e7) therefore have the form

$\begin{matrix} {{d \cdot s} = {{2\frac{m - \beta}{\alpha}} - 1.}} & \left( {e\; 15} \right) \end{matrix}$

As before, this results in two pairs of linear equations, each pair determining the x and y components of one of the two output Stokes vectors. They can be solved for s₁ and s₂ in the same way as before, except that the number on the right is computed in a different manner from the measured signals. If the total power is not correctly known but is consistently “wrong” by some factor, the computed value of m may not correctly represent the fraction of the real optical power that is detected, but the calibration of α and β by determining the maximum and minimum m values will still give a correct result. That is evident from equation (e15), where scaling m, α and β by the same factor will make no change. What matters is that m, as computed by some means from the measured detector signals, is proportional to the fraction that is detected.

As with the first hardware embodiment, once the output Stokes vectors s₁ and s₂ are determined both before and after a change, they can be used to determine the change matrix M by the method previously described.

Embodiment 4 Two Launch States; Three Analyzer States

A fourth preferred embodiment of the hardware is illustrated in FIG. 18. It is similar to the hardware embodiment described above in connection with FIG. 17, except for specific differences described below that have performance advantages that suit particular applications. For high-sensitivity applications, the addition of a third polarized detector has the advantage of preventing the enhanced noise sensitivity that is found in the two previously described second and third embodiments, making mitigation methods such as attenuation of the z-component of change unnecessary, and eliminating the residual fading that accompanies such mitigation.

In comparison to FIG. 17, a first difference of FIG. 18 is that a third detector 126 has been added with a beamsplitter 120 for deflecting part of the light to this detector, and with a polarizer 134 for polarizing the light. As in the hardware embodiment of FIG. 17, all three polarizers 126, 128 and 130 can have poor extinction. There are three analog-to-digital converters in the digitizer unit, one for each detector.

A second difference is that the direct connection from the timing control unit 116 to the Digitizers 142 is eliminated, and the timing control unit does not feed synchronous clock information to the digitzer unit to tell it when to convert. Instead, there is a digitizing clock 146 that triggers conversions asynchronously at a faster frequency of 100 conversions per cycle of the polarization modulation, or 100 kHz. The digitizing clock 146 and the timing control unit 116 that generates the polarization modulation frequency are not synchronized or locked together, so their relative phase will drift. Stated differently, the digitizing frequency is nominally an integer multiple of the polarization modulation frequency, so that the points in time when the signals are digitized will be very close to the same for each successive cycle; but this is not exactly true, so the pattern of digitizing times will drift slowly relative to the polarization modulation waveform. To accommodate this drift, the processor discerns the phase, and adjusts its processing accordingly, as described below. Furthermore, the output of the timing control unit 116 that controls the polarization modulation is not a symmetric square wave. Instead, it causes the polarization control unit to dwell 55% of the time on the first polarization state, and 45% of the time on the second state. (This description will continue to refer to the two “half cycles” even though one is a little more than half, and the other is a little less.) The reason for this asymmetry is to enable the processor to determine which half cycle is for which launch state.

Both the timing control unit 116 and the digitizing clock unit 142 are controlled by crystal oscillators so that the 100:1 ratio of the digitizing frequency to the polarization modulation frequency is accurate to one part in 20,000. This implies that for each successive 0.1-second block of data (100 polarization modulation cycles and 10,000 digitizing events), the position of the phase of digitizing will drift at most by one half of a digitizing period relative to the polarization modulation waveform. The signal for each half cycle is found by averaging a number of points, only now there are more points and some asymmetry, so that 45 points are averaged during the first (longer) half cycle, and 35 points are averaged for the second (shorter). Ten points are discarded during each transition. (The choices of these numbers depend on the transition speed of the polarization control unit 114.) Before computing these averages for the two half cycles, the processor determines the phase to use for the entire block, by examining a the signal from a selected one of the three detectors over a section of the block of 10,000 points, comprising 1,000 points, or 10 polarization modulation cycles, in the middle. The selection process and the examination method are described in the next two paragraphs below.

For selecting which of the three detector signals to examine for phase, the processor computes the mean-square deviation of each signal from its average, over the central 1,000 points that will be examined, and it selects the signal for which this deviation is the largest. At least one of the three will have a non-zero amplitude of modulation due to the polarization switching. This results from “substantially linearly independent” requirements that we will impose on the three detection states and on the two launch states. Those requirements and the results are explained further below, after presentation of the mathematics for data reduction.

For examining the central 1,000 points from the selected detector for phase, we first average all ten successive cycles of data to produce a single 100-point cycle with unknown phase and reduced noise. Then we modify this 100-point waveform by subtracting its mean. Then we digitally convolve this with a synthesized model waveform that is two cycles long, comprising 55 points of value +1, 45 of −1, 55 of +1, and 45 of −1, to produce a correlation function of the measured waveform with an ideal waveform. Then we find the index, in the resulting correlation function, where the absolute value of the correlation function is a maximum. This tells us, by standard techniques, where the boundaries of the two half cycles are, i.e. it tells us the desired phase. The details of the implementation will of course vary, depending on the programmer.

In the example of this embodiment, each block of data that is processed is meant to be “aligned” so that its start is very close to a transition to the longer (55%) first phase from the shorter (45%) second phase of polarization modulation. This ensures that the first several points and the last several points in the 10,000-point block are near the transitions and therefore will be ignored. Therefore, the phase of processing (the choice of which points to integrate in each half cycle) for each block can be shifted by one or two points based on the measured phase of that block, with no need to add points to either end of the block. Besides its use for selecting which points to integrate, the measured phase for each block will be used to properly align the phase of the next block of data by shifting it relative to the data stream, either by collecting more than 10,000 new points and discarding one or more points from the beginning, or by collecting fewer than 10,000 new points and adding one or more points to the beginning from the end of the previous block. This continual phase measurement and re-alignment is how the system tracks the phase of the data. The number of points by which each block is shifted (according to the measured phase of the preceding block) will generally be zero, +1 or −1. An exception is that the first block, seen when data collection begins, may have a large phase shift so that a large number of extra new points will need to be added to the end of the second block to bring it into alignment; and so that this first block has only 99 complete cycles of non-transition data, plus fractional cycles at the beginning and the end. The first block will therefore be used only for phase measurement, and other computations will be done only for blocks after the first. These other computations comprise the determination of 100 sets of six detector signal values (one set for each 1-millisecond cycle, each set comprising three average detector signals for the first polarization state and three for the second), subsequent determination of output Stokes vectors and change matrices, and subsequent processing for alarms, measurements or other assessments.

For signal processing to determine the changes in the two output Stokes vectors s₁ and S₂, the processor will use equations similar to (e14) or (e15) to indicate the information about a Stokes vector that is derived from the signal on a detector. However, we will include a separate parameter that is related to the power of the laser, in order to show how that can be determined from the three polarized-detector signals, without using a polarization-insensitive detector as in the second embodiment, or a separate means for knowing it such as calibration as in the third embodiment. Equation (e15) indicates a linear relationship between the scalar product of the output and detection Stokes vectors and the measured signal on the detector. Instead of the symbol m, which we have defined as the fraction of the light that is detected, let us use a symbol P_(i) to designate the power on the i^(th) detector in some units. Then equation (e15) can be expressed in an equivalent manner as

$\begin{matrix} {{d_{i} \cdot s} = {{{\frac{2}{\alpha_{i}}m_{i}} - \left( {\frac{2\beta_{i}}{\alpha_{i}} + 1} \right)} = {\frac{2\; P_{i}}{\alpha_{i}P_{0}} - {\gamma_{i}.}}}} & \left( {e\; 16} \right) \end{matrix}$ Here γ_(i) is defined in terms of α_(i) and β_(i) as the expression in parentheses. We will consider α_(i) and γ_(i) to be the calibration parameters (instead of α_(i) and β_(i)). We have expressed the detected fraction of the light as m_(i)=P_(i)/P₀, where P₀ is a measure of the total power that is proportional to the actual power and is consistent with P_(i) and the means for calibrating α_(i) and γ_(i). The calibration parameters α_(i) and γ_(i) depend on the individual detector and on which output state is being measured. They are determined in a way that makes the expression in this equation range from −1 to +1 as s is rotated through all positions on the Poincaré sphere.

It is convenient to rewrite equation (e16) for all three detectors in a matrix and vector notation: Ds=2ap−g.  (e17)

Here D is a matrix in which each row comprises the three components of one of the detection Stokes vectors, and its product with the column vector s creates a column vector in which each element is one of the scalar products of equation (e16). The parameter a is the reciprocal of the total power measure P₀ in the light beam (not just the polarized part represented by the Stokes vector s), and we will call it the “power factor.” Each element of the column vector p is not P_(i), but a term P_(i)/α_(i) that can be called “corrected power”, and each element of the column vector g is one of the terms γ_(i) that appears in (e16).

We assume that the three detection Stokes vectors are linearly independent. This implies that the three rows of D are linearly independent and therefore, as is commonly known, the matrix is non-singular and has an inverse D⁻¹ that can be found by well-known methods. The Stokes vector s can therefore be expressed as s=aq−r,  (e18) where for convenience we have defined new vectors q and r as q=2D ⁻¹ p,r=D ⁻¹ g  (e19) These are vectors in Stokes space, mapped by D⁻¹ from the sets p and q of quantities derived from detector signals and calibration parameters. Note that q depends on measurements of the light, and varies as the fiber is disturbed; but r depends only on the detection states, combined with information about extinction of the detection filters and the degree of polarization of the light.

Because D⁻¹ and g depend on calibration information that we assume is known, and the corrected power information in p comprises a combination of measured signals with known calibration information, equations (e18) and (e19) define the Stokes vector s precisely if the power factor a is known. But this embodiment is meant to address cases where the power may drift and we need to track it, or where it is unknown when a system begins operation, so a must be determined. Information about a is provided by the fact that s is a unit vector. Therefore, 1=s·s=|q| ² a ²−2[q·r]a+|r| ².  (e20) This has the common form Aa ² +Ba+C=0,  (e21) where A=|q| ² ,B=−2q·r,C=|r| ²−1  (e22) Equation (e21) has two possible “roots” or solutions for a, which we will denote as a₊ and a⁻ and which can be expressed as

$\begin{matrix} {{a_{+} = \frac{{- B} + \sqrt{B^{2} - {4\;{AC}}}}{2\; A}},{a_{-} = \frac{{- B} - \sqrt{B^{2} - {4\;{AC}}}}{2\; A}},} & \left( {e\; 23} \right) \end{matrix}$

We need to select between these two roots to have a solution, but before proceeding with that subject we will make some brief mathematical observations. With elementary algebra, it is possible to solve (e18) for q and substitute this into (e22) and (e23) to derive an expression for the “wrong” solution a_(w) in terms of the correct a-value and the actual Stokes vector s:

$\begin{matrix} {a_{w} = {\frac{a}{1 + {\frac{2}{C}\left( {1 + {r \cdot s}} \right)}}.}} & \left( {e\; 24} \right) \end{matrix}$

Whether this wrong solution a_(w) is a₊ or a⁻ depends on the sign of the (1+r·s) term in the denominator. A quantity related to this term is the “discriminant” Q, defined as

$\begin{matrix} {Q = {{B^{2} - {4\;{AC}}} = {\frac{4}{a^{2}}{\left( {1 + {r \cdot s}} \right)^{2}.}}}} & \left( {e\; 25} \right) \end{matrix}$

The two roots are equal when the discriminant is zero. The rightmost expression is the square of a real number, implying that the discriminant is never negative, so that both of the roots (e23) are real. It is also noteworthy that the discriminant can attain a value of zero for any system that allows the unit output Stokes vector s to vary over its full range. This follows from the fact that a unit vector s can always be found that makes r·s equal −1 in (e25), which results from the fact that the magnitude of the vector r is greater than unity. To see this, note from the definition (e19) that g=Dr. Since each row of the matrix D is one of the three detection Stokes vectors, this means that each component γ_(i) of g is the scalar product of r with the unit vector d_(i), which can be written as γ_(i) =d _(i) ·r=|d _(i) ∥r|cos θ_(i),  (e26) where θ_(i) is the angle between the two vectors. The cosine must be less than 1 for at least one index i, because we are assuming that the vectors d_(i) are linearly independent; and the magnitude of d_(i) is 1. Therefore the magnitude of r must be greater than γ_(i). But each γ_(i) is greater than or equal to 1, from its definition in (e16) and the fact that α and β are both non-negative. Therefore the magnitude of r is greater than 1.

For selection between the two solutions, we will assume that the light intensity is steady and changes significantly more slowly than the fiber's transformation. This implies that the wrong solution in (e24) will change in value as s changes, while the correct solution will be nearly constant. That provides a selection criterion. In this embodiment, we continually compute the “recent average” of the selected a value over a recent time period. This time period is adjustable and will typically be on the order of 1 to 10 seconds for a security system. Each time that a Stokes vector is determined, we pick the solution or a₊ or a⁻ that is most nearly equal to the recent average value of a.

This embodiment has a selection enhancement method that overrides the simple selection rule described above, in particular cases. The enhancement provides means to initialize the recent average value when the system begins operation, and to recover from an error that may have latched the system to the wrong solution for the power factor a. Such an error can occur, for example, when Q in equation (e25) is close to zero so that the correct and wrong solutions, related by (e24), are approximately equal. Suppose now there is a significant drift in the laser intensity, and simultaneously there is a drift in the fiber that causes the wrong solution a_(w) of equation (e24) to stay nearly constant, while the correct solution drifts. Although this is unlikely (and good control of the laser intensity will make it more unlikely), the algorithm described above could cause the wrong solution to be tracked to a point in time where the two solutions are significantly different, changing the effective sensitivity of the system until the condition can be discovered by the signal-processing software and corrected. That possibility is a motivation for the selection enhancement method described below.

Until now we have discussed using the detector signals to determine the Stokes vector for a single output state. But there are two output states, and the selection enhancement method will take advantage of this. The two output states can be distinguished notationally by adding a subscript 1 or 2 to the vectors s, s′, p, g, r and q, and to terms such as A, B, Q, C and a. The vectors D and d_(i) are properties of the detectors that do not depend on which output state is being measured.

A first aspect of the selection enhancement method is the use of the power ratio of the two output states. Depending on the polarization control unit 18, the two powers may be identical, but we can at least make it so that the ratio is quite constant. This ratio is measured and recorded as a power-ratio calibration parameter K_(p), defined as

$\begin{matrix} {{K_{p} = \frac{a_{1}}{a_{2}}},} & \left( {e\; 27} \right) \end{matrix}$ where a₁ and a₂ are the power parameters a for the two output states. Each time we compute the Stokes vectors, we compute this ratio for all four combinations of possible solutions:

$\begin{matrix} {{K_{p\; 2} = \frac{a_{1 -}}{a_{2 +}}},{K_{p\; 3} = \frac{a_{1 +}}{a_{2 -}}},{K_{p\; 4} = {\frac{a_{1 -}}{a_{2 -}}.}}} & \left( {e\; 28} \right) \end{matrix}$

Here a₁₊ means the solution a₊ computed from equation (e23) for the first output state, a₂₊ means the solution a₊ for the second output state, and so forth. These four “trial K_(p) values” K_(p1) through K_(p4) will be used for selection as described below.

A second aspect of the selection enhancement method is the use of the angle between the Stokes vectors of the two output states, or equivalently the use of its cosine, which is the scalar product of the two vectors. We expect this to remain constant. The scalar product is measured and recorded as angle calibration information K_(a), and is related to the Stokes vectors by the equation K _(a) =s ₁ ·s ₂.  (e29) We define s₁₊ as the Stokes vector s computed from equation (e18) using a₁₊ and the measured information q₁ for the first output state, and so forth. Each time we compute the Stokes vectors, we compute this scalar product for all four combinations of possible solutions: K _(a1) =s ₁₊ ·s ₂₊ ,K _(a2) =s ¹⁻ ·s ₂₊ ,K _(a3) =s ₁₊ ·s ²⁻ ,K _(a4) =s ¹⁻ ·s ²⁻.  (e30) These four “trial K_(a) values” K_(a1) through K_(a4) will be used for selection as described below.

A third aspect of the selection enhancement method is that, at times determined by a pre-established rule, we record the set of four trial K_(p) values and four trial K_(a) values for some number n of measurements in a row, and each time we compare all n sets of values to the corresponding calibration values. If the values for a particular combination of solutions gives better agreement with the calibration values for all n measurements than the current selected combination, then we switch to using that combination as the selected combination. If, as a result, we have switched the solution for the first Stokes vector, then we force the recent average value for a₁ to be the new value. The same applies to a₂, if its solution has been switched. In the example of the present embodiment, we compute and record the different measured K_(p) and K_(a) values for the last ten cycles of each block of 100 cycles of the polarization modulation. Then, a decision to switch the selected combination of solutions, if it is made, is applied to the signal processing for the next block of 100 cycles. Also in this example, assessments of changes in the fiber's polarization transformation are made independently for each block of 100 cycles, ignoring any apparent sudden change at the boundaries of the blocks. Because of this, a sudden jump in the computed Stokes vectors, caused by switching the choice of solutions at the block boundary, will not cause a false alarm.

A fourth aspect of the selection enhancement method is that the processor reports the values of the measured power ratio and the scalar product to an operator of the system, on demand. This enables the operator to observe whether the system has drifted significantly from its calibration.

When the system first begins measurements, it does not report them as valid data until ten blocks of data have been processed. This gives the selection enhancement method time to find solutions for the power parameters a₁ and a₂ that are consistent with the calibration information for the power ratio and the scalar product between the Stokes vectors. The time period for the recent averages of a₁ and a₂ cannot be the full specified period initially, so averaging is done only over the shorter history from the beginning to the present, until the beginning becomes more distant than the specified averaging period such as 1 to 10 seconds. The same kind of shorter history is used temporarily when we modify the recent average upon switching the selected solution for one of the power parameters, as described above.

In summary, the procedure for computing the output Stokes vectors, for each cycle of the polarization modulation, is to measure the power values P_(i) for each detector for the first output Stokes vector during the appropriate half cycle; compute P_(i)/α_(i) from the calibration information α_(i) for each detector; assemble these ratios into the column vector p; operate with D⁻¹ to produce q according to equation (e19); evaluate A, B and C according to (e22); compute both of the solutions in equation (e23); compute the two possible Stokes vectors s from equation (e18); repeat all of this for the second output Stokes vector; and select the solutions according to the selection methods described above. These are elementary computations that enable us to install a monitoring system on a fiber of unknown loss without the need to calibrate the power levels, and to recover automatically in case an unusual event causes the system to lose track of which solutions are correct for the power parameters.

As with the first hardware embodiment, once the output Stokes vectors S₁ and s₂ are determined both before and after a change, they can be used to determine the change matrix M by the method previously described.

The three Stokes vectors for the three detection states are required to be linearly independent for reasons discussed above, and for some additional reasons described below. Some requirements for the launch states are also discussed below.

As described above, this embodiment has the ability to adjust to the optical power level and to track drift in the laser power, using only three polarized detectors made with inexpensive, imperfect components, without the need for a detector that is precisely polarization independent. Also, for high-sensitivity applications, it is immune to the occasional enhanced noise sensitivity that is found in embodiments 2 and 3. This can be seen by rearranging and differentiating equation (e17), to get an expression for the differential changes dp in the three detector signals, in terms of the differential vector change ds in a measured Stokes vector:

$\begin{matrix} {{dp} = {\frac{1}{2\; a}D\;{{ds}.}}} & \left( {e\; 31} \right) \end{matrix}$

Here the matrix D is non-singular because the three detection Stokes vectors that comprise its rows are assumed to be linearly independent. Therefore, there is no non-zero differential s-vector movement ds that is annihilated by D, i.e. that will cause zero for the set dp of first-order changes in all three detection signals. In other words, the condition of enhanced noise sensitivity of embodiment 2 does not occur. To make this robust, we want D to be in some sense substantially non-singular, or the three detection Stokes vectors to be substantially linearly independent. An example of such a requirement is described in the next paragraph, where it is also applied to the phase detection algorithm.

The phase detection algorithm described above relies on an assumption that at least one of the three detectors will always have a measurable square-wave signal due to the polarization modulation, as does our ability to determine the change matrix without encountering periods of enhance noise sensitivity. If the optical power and degree of polarization are the same for the two output polarization states, this requirement is equivalent to requiring that the three detection Stokes vectors should be linearly independent. This follows from equation (e31). Although this was written as a differential expression applied to a change in just one of the two output Stokes vectors, it is valid for large changes because equation (e17) is linear; and we can interpret ds in (e31) to be the change between the two output polarization states during a single cycle, ds=s₂−s₁. (This is not a small differential quantity.) Linear independence of the three Stokes vectors (from which D is constructed) implies that at least one component of dp is non-zero for any non-zero change ds in the Stokes vectors. The change ds always has the same large magnitude, but can vary in orientation. If the three detection Stokes vectors should all lie nearly in one plane through the origin (though not exactly, since they would then be linearly dependent), then dp could still be quite small when ds is nearly orthogonal to that plane. For this reason, we want assurance that the three detection Stokes vectors are not nearly co-planar. This requirement can be stated by requiring that the angle between any detection Stokes vector and the normal to the plane defined by the other two (the direction of the “worst-case” ds) should be no more than some angle. If that limiting angle is 60 degrees, then when the signal change for two of the detectors is zero, the change for the remaining detector will be no less than half of what it would be if the angle were the ideal value of zero, because the cosine of 60 degrees (proportional to the scalar product) is 0.5. On these grounds, the angle between any detection Stokes vector and the normal to the plane defined by the other two should be no more than 60 degrees. Equivalently, the angle between any detection Stokes vector and the plane defined by the other two should be no less than 30 degrees.

If the optical power and/or degree of polarization is different for the two polarization states, a detector will see modulation due to a combination of intensity changes and changes in the scalar product of its Stokes vector with the Stokes vector for the change. The two could cancel under the right circumstances, and this makes analysis more intricate. For detector i and output Stokes vector s₁, equation (e16) can be written as

$\begin{matrix} {{P_{i\; 1} = {\frac{\alpha_{i\; 1}}{2}\left( {\gamma_{i\; 1} + {d_{i} \cdot s_{1}}} \right)P_{01}}},} & \left( {e\; 32} \right) \end{matrix}$ where P₀₁ is the total power in the first output state. From this, the change in the measured power on the i^(th) detector when we switch states is P _(i2) −P _(i1)=½(α_(i2)γ_(i2) P ₀₂−α_(i1)γ_(i1) P ₀₁)+½(α_(i2) P ₀₂ s ₂−α_(i1) P ₀₁ s ₁)·d _(i).  (e33) This can be re-written more simply as ΔP _(i)=δ_(i)+½w _(i) ·d _(i),  (e34) where the new parameters are defined implicitly in the obvious way. Each vector w_(i) can be viewed as a weighted difference between the two output Stokes vectors, with weights that depend on the two powers, the two degrees of polarization, and the extinction ratio for the i^(th) detector.

This change in the detector signal is more complex than before. Rather than stating limits on different relationships between parameters that describe power, extinction and degree of polarization, we will simply note that the states, the extinction of the filters and the degrees of polarization for the two input states must be such that at least one of the three power changes (for i=1, 2 or 3) will be substantially non-zero for any rigid rotation of the vectors s₁ and s₂ together, i.e. for any rotation of all three w_(i) vectors. We can take “substantially non-zero” to mean more than 20% of the maximum signal that the detector can experience. Even though it involves many parameters, equation (e33) or (e34) is easy to evaluate by a person of normal skill, and the process of testing a set of parameters for acceptability is a straightforward problem. After determining all three δ_(i) and w_(i) for a particular set of parameters, it is straightforward to find the rotations that make (e34) zero for two detectors, and evaluate the expression for the third to see whether it meets the requirement of being substantially non-zero.

Embodiment 5 Three Launch States; Two Analyzer States

A fifth embodiment using a modified hardware configuration is illustrated in FIG. 19. It is similar to the embodiment described above in connection with FIG. 18, except for specific differences described below that have advantages in particular applications. In particular, this design achieves similar advantages over embodiments 2 and 3, but it achieves this with two polarized detectors instead of three, by using three launch states that are linearly independent and are created with a single variable retarder.

In comparison to the embodiment of FIG. 18, the chief difference of the embodiment shown in FIG. 19 is that detector 126 is omitted, along with its associated beamsplitter 120 and polarizer 134, and its associated A-to-D converter in the digitizer block 142. Another difference, not explicitly apparent in the figures, is that the polarization control unit 114, under command from the timing control unit 116, successively creates three different launch polarization states instead of two in each polarization modulation cycle. These three states are required to have Stokes vector representations that are linearly independent.

Details of the polarization control unit 114 are shown in FIG. 20. Timing information enters the polarization control unit 114 via port 71, which comprises two digital signal lines that are encoded to represent three different states to a three-state voltage generator 72. For each of the three states (i.e., for each phase of the polarization modulation cycle) the voltage generator 72 feeds a different voltage to the variable circular retarder 73, so that it creates three different amounts of circular retardance. To enable the detection system to discriminate between the three states, their time durations are all made different. For this embodiment, the durations of the three phases are approximately 30%, 33% and 37% of the modulation cycle, respectively.

Linearly polarized light (from the laser 112 of FIG. 19) enters the optical port 75 and passes through a tilted linear retarder 76, which converts the light to an elliptical polarization state before it enters the variable circular retarder 73. The variable circular retarder 73 rotates the Stokes vector about the polar axis of the Poincaré sphere by a variable amount, transforming its input light to different elliptically polarized states. It is constructed in a manner similar to the General Photonics PolaSwitch™ product, except that three different input voltages cause rotations of approximately 0, 120 and 240 degrees on the Poincaré sphere relative to the first state (instead of the Stokes-space rotation of approximately 90 degrees indicated for the third embodiment). The linear retarder 76 is designed to place the Stokes vector of the light entering the variable circular retarder 73 on a line of latitude with an angle that approximates an ideal value of 35.26 degrees above the equator on the Poincaré sphere. These conditions will make each of the three launch polarization Stokes vectors have angles of approximately 90 degrees with respect to the other two, making them strongly linearly independent, and providing a better signal-to-noise ratio than if the three Stokes vectors came closer to lying in a single plane.

The data from this embodiment can be analyzed with effectively the same equations that were applied to the fourth embodiment, except that the meanings of the parameters and variables have changed. This similarity follows from the reciprocity theorem, which states that the optical coupling between a source at a first location and a detector at a second location is the same as the coupling between a source at the second location and a detector at the first location. The reciprocity theorem is well known, and is discussed theoretically, for example, by Larry Di Girolamo, “Reciprocity principle applicable to reflected radiance measurements and the searchlight problem,” Applied Optics Vol. 38, No. 15, P. 3196 (1999); and experimentally, for example, by Min-Joong Kim, “Verification of the reciprocity theorem,” Applied Optics Vol. 27, No. 13, p. 2645 (1988). The brief discussion below summarizes the mathematics for the fourth embodiment, and then indicates how equations (e16) through (e34) should be reinterpreted for this fifth embodiment.

Equation (e15) is a general formula that relates the “coupling” m (proportional to the fraction of the total power that is detected), for a single launch state and a single detection state, to the scalar product d·s of the fixed detection-state Stokes vector d with the output Stokes vector s, which varies as the fiber is twisted or bent or otherwise disturbed. These unit Stokes vectors describe the polarized parts of the detection filter and the transformed launch beam respectively, for cases where the extinction of the detection filter may be imperfect and the degree of polarization of the launched light may be less than 100%. The equation assumes that changes in the fiber do not alter the degree of polarization. The two calibration parameters α and β are defined in terms of the maximum and minimum values that m can attain, as discussed before, and they can be determined by directly measuring the range of m, or by computations from knowledge of the extinction and degree of polarization combined with other information such as detector sensitivity. These two parameters generally depend on which detector and which launch state the equation is applied to. Equation (e16) is the same as (e15) except that a subscript i distinguishes which detector; and it implicitly defines γ_(i) in terms of α_(i) and β_(i). This is for a single launch state, and the dependence on the launch state is understood.

For this fifth embodiment, equation (e16) still applies, but we adapt it by taking d_(i) to mean the Stokes vector for the i^(th) one of the three launch states (instead of detection states), and s to mean the Stokes vector resulting from a particular one of the two detection states (instead of launch states), after it is projected or transformed by the fiber back to the input end of the fiber (instead of forward to the output end). Each m, is a measured number proportional to the power on a particular detector for the i^(th) launch state, and the parameters α, and β_(i) or α_(i) and γ_(i) are defined in the same way as before, to make the expression in equation (e16) range from −1 to +1 for each combination of a detector with a launch state.

We adapt equation (e17) by constructing the matrix D and vector g in exactly the same way from the three d_(i) vectors and the γ_(i) parameters. For each back-projected detection state, equation (e18) then defines the Stokes vector s as before, in terms of the vectors defined by (e19) and the power parameter a. The power parameter a for each projected Stokes s vector is then found by selecting one of the two roots indicated in (e23), according to rules described after equation (e26).

After the two Stokes vectors are found both before and after a change, the change matrix M is determined by the method previously described, and further conversion and/or reduction of the data can then be done for assessment of the disturbance to the fiber. Unlike the matrix determined in the first three embodiments, this matrix does not represent the change in the fiber's “forward” transformation from the input end to the output end. Instead it represents the change in its “reverse” transformation from the output end to the input end. That difference is insignificant in most applications of this sensing method. It is as if the two polarized detectors (detector 30 with polarizer 34 and detector 32 with polarizer 36) actually represented differently polarized light sources, and the three launch states actually represented polarized detectors. A difference is that the two imitation launch states (actually detection states) are sampled simultaneously instead of sequentially, and the three imitation detection states (actually launch states) are sampled sequentially instead of simultaneously.

In the example of this embodiment, as in the fourth embodiment, each of the two detector signals is digitized 100 times in each 1-millisecond cycle of the polarization modulation. The three phases nominally last 30, 33 and 37 digitized points. For the i^(th) launch state (i=1, 2 or 3), the three m, values in each cycle will be computed by averaging about 20, 23 and 27 points respectively during the stable parts of the cycle, ignoring 10 points in the vicinity of each transition of the modulated polarization state. It is necessary to determine the phase of the polarization modulation cycle relative to the digitizing process, in order to decide which points to average. As with embodiment 4, we will determine the phase for a 10,000-point (0.1-second) block of data by processing a central section of 1,000 points or 10 polarization modulation cycles. But we now have two detector signals, each with three-phase modulation (instead of three signals with two-phase modulation) so the analysis method is different. First we subtract the two signal arrays to create a difference array. Next we digitally differentiate this array, and from this we create a new smoothed array, in which each point is the average of about five contiguous points in the original differentiated array. Then we create a final data array that is the absolute value of this smoothed array. This will be convolved with a model array that is constructed by starting with an array of 100 zeros (indexed 1 to 100 in the notation used here), and setting the values to 1 at points 29, 30, 31, 62, 63, 64, 99, 100 and 1. This array is then appended to itself to make a 200-point model array, and this is convolved with the final data array. The index of the point where the convolution or the correlation function is a maximum will provide information about the phase of the data. The details of the implementation will of course vary, depending on the programmer. The phase information is used in the same way as for embodiment 4 to determine which points to average to get the three m, values for each launch state in each cycle of the polarization modulation.

The foregoing descriptions of several embodiments illustrate how different combinations of launch states and analyzer states can be used to produce Change Information that can be used to derive measures and make decisions using much the same signal processing steps 4.13, 4.14 and 4.15. It is also envisaged that the signal processing might be modified too. For example, as an alternative to Measure-1 and Measure-2, the Decisions and Reporting process 4.16 may evaluate the Change Information for geometrical or dimensional characteristics of the motion.

The examples of measures can be computed in multiple ways that may look different but are nevertheless the same, or that are the same except for small differences due to the use of different approximations. Different measures can be devised that embody the same principles to make them substantially free of fading, and these are not necessarily confined to those that can be easily identified as dependent on frequency-domain or time-domain analysis. Some examples are given below.

Any particular one of the measures can be computed from the detection signals in a number of different ways. Different possible ways of computing these same measures may produce slightly different values as a result of the computational methods or of mathematical approximations that are used, but for disturbances characterized by small rotation angles of the Poincaré sphere they are the same to first order in the scale of the angles. For example, the three Euler angles of a small rotation can be taken as components of a vector to create the step vector. For the purposes of this disclosure, the measures produced by such alternative methods are considered the same, although one or another may be preferred in a particular application for computational efficiency or for the quality of its approximations. Also, computations that differ in inconsequential ways, such as a scale factor or simple functional dependence, are considered the same.

As a simple example of different computational methods achieving the same end, note that the isotropic power spectrum obtained by Fourier analysis of a block of change-space position vectors could also be found by similar analysis of a block of step vectors. That is because the step vectors are like time derivatives of the position vectors, so the Fourier transform of a block of position vectors can be found by transforming a corresponding block of step vectors and dividing each point in the resulting complex array by its associated frequency. As another example, note that, for some block of change-space position data, the integral of the isotropic power spectrum of over some narrow band can be found by taking the block of change-space position vectors and applying a suitable narrow-band digital filter, then computing the mean value of the squared vector magnitude of the resulting filtered vector sequence.

While the change-space path process of computing Measure-1 is a convenient way to represent the change information and provides an easy means for computing the net angular rotation over different time intervals, this same measure can also be computed by methods that do not involve change space. For example, the rotation angle for a large time interval can be computed from the Stokes vectors at the start and at the end without reference to change space (e.g., by using them to computed a change matrix for the large time interval, then finding the rotation angle implied by the matrix from the formulae used previously to determine the step vector). While the examples of embodiments store the change information 4.23 in the form of change-space position vectors for subsequent operation on blocks of data by measure-computing procedures such as 4.14 and 4.15, change information can be stored in other ways. For example, a sequence of Stokes-vector pairs [s_(a)(t), s_(a)(t)] for different times t can be stored and then operated upon to compute measures. Such a sequence of Stokes vector pairs contains the change information, even though it is less compact than a sequence of change-space position vectors or step vectors, and even though it retains additional information (e.g. the angle between the Stokes vectors for the two launch states) that may be eliminated in the final computation. In fact, the raw data could be stored instead of the Stokes vectors, because that, too, contains (when supplemented with calibration or other information) all of the necessary information about the fiber changes (plus more). This would simply add complexity to the measure computations, while removing it from the preparatory processing such as 4.12 and 4.13.

As examples of different measures encompassed by the invention, it is useful to reduce the information in a change-space path by computing various qualitative or quantitative measures that characterize geometric or dimensional motion in change space, because this has implications about the geometric complexity of the physical disturbance to the fiber. It is useful to determine, for example, whether a disturbance chiefly elicits one-dimensional motion in change space, or whether the movements are dominantly in some two-dimensional plane, or spread among all three dimensions; or whether two disturbances are dominantly in the same plane or different planes. These simple questions about the character of motion in change space clearly have answers, and there are different ways to analyze such character that can be devised by one of normal skill in the art. As an example of an analysis method, a covariance matrix can be computed for the three vector components of a set of all of the change-space position vectors C over some moderate time interval. The covariance matrix is then diagonalized to find its eigenvectors and eigenvalues. (This technique is known to those skilled in the art, and is encompassed by standard “principle component analysis.”) If one eigenvalue is much larger than the other two, the motion is predominantly one-dimensional. If two eigenvalues are comparable and much larger than the third, it is two dimensional, and the disturbance that created it must have a two-dimensional nature. The two associated eigenvectors indicate the plane of motion in change space. If two disturbances at nearly the same time have the same plane of motion in change space, there is an increased likelihood that they come from the same place in the fiber. The angle between the planes that characterize two disturbances can be computed as a useful measure, and if it is large there is an increased likelihood that the two disturbances come from different places in the fiber. As another computational method for geometric or dimensional analysis, it is useful to examine the Fourier transforms of the x, y and z components of C(t) before squaring them to produce power spectra, because they contain phase information. If there is a peak or resonance at some frequency, for example, the amplitudes and phases of the three Fourier transforms, evaluated at that frequency, will provide information on both the direction(s) and shape of the oscillation (linear, circular, elliptical, etc.)

In a more detailed example of the use of a geometric measure, the processor determines the change-space position, as prescribed above, once every millisecond, and continuously applies a standard low-pass digital filter to the three stream of position values for the three change space coordinates, storing the filtered change-space position values in a circular buffer with 20 seconds of past data. The filter attenuates information at frequencies higher than 4 Hz. Each 0.1 second, the processor determines the distance between the most recent vector position and the position at a time 10 seconds ago. If this is greater than a threshold set by the user, it examines the changes further by computing the 3×3 covariance matrix for the x, y and z arrays of filtered change-space coordinate values recorded between 10 seconds ago and the present time. Then it determines the largest two eigenvalues for this matrix, and computes the ratio of the larger to the smaller. If this ratio is less than a threshold set by the user (such as 20, indicating that the disturbance has prominent motion in at least two dimensions instead of being chiefly linear), the processor creates notification of a reportable disturbance.

It might be desirable, and possible, to create an equivalent measure m′ that is benignly dependent upon some constant attribute or set of attributes, a, of the analyzer states and/or launch states and is also dependent on a measure m that is substantially independent of the launch and analyzer states, according to a relationship having the form m′=f(m,σ). For example, σ might be the scalar product/products of two or more of the Stokes vectors characterizing the states. The function ƒ might be arbitrarily constructed or might seem to result naturally from some measurement process that would obscure its nature. Such a measure m′ is considered in this specification to be equivalent to the measure m, and in that sense to represent a measure that is substantially independent of the analyzer states and/or launch states.

The Decision and Reporting processes using Measure-1 and/or Measure-2 to determine whether a disturbance should be reported can be done in many different ways, and has been done in many different ways in existing products marketed for monitoring and security. Often a measure is a numerical quantity that is compared to a threshold, and the threshold can be adjusted by the user of a monitoring system to optimize the reporting of disturbances that should be reported, and minimize “false positive” reports or false alarms, i.e. disturbances that are reported but should not be. Several layers of discrimination can be used, such as registering an “event” when a threshold is exceeded, but not reporting an “alarm” unless at least some number of events, such as three, occurs within some time period, such as ten seconds. This can be viewed as using one or several measures to construct new measures. As an example, products are commercially available that offer the ability to sum the components of a power spectrum over a selected frequency range to create a measure, updated regularly at short time intervals, that can be compared to a threshold. But instead of instantaneous comparison, it can be effectively averaged over an adjustable time and the average (which is yet another measure) can be compared to a threshold. Exceeding the threshold creates an event, and the user can set up the system so that an alarm is issued if the number of events occurring in some time interval (which is yet another measure) exceeds some particular number.

Systems and methods embodying the present invention may substantially eliminate fading and, relative to previously-known systems and methods, may liberalize the requirements for the states of polarization that are launched into the fiber, and for the polarization analyzers used for detection. Accordingly, using such a method, the measure(s) of the change in the fiber polarization coupling properties may be accurate and stable independently of the factors that cause the fading or signal variability to which known monitoring systems are susceptible. Moreover, the logical design and manufacturing techniques are simplified because the specifications for polarization components are less narrow. Also, the manner in which embodiments of the present invention process data to substantially eliminate fading may yield improved performance, for example easier setting of alarm thresholds, as compared with the known systems discussed hereinbefore.

In preferred embodiments of the invention, each of the measures of the change is substantially independent of particular polarization states of the light as launched into the fiber or used to analyze the received light, temperature drifts, or static bends or stresses in different regions preceding or following the location of the apparent intrusion.

INDUSTRIAL APPLICABILITY

Although the above-described embodiments of the invention monitor optical fiber to detect disturbance associated with attempts to tap the fiber, the invention encompasses other applications where it is desirable to detect disturbances. One such application is the detection of intruders disturbing the fiber when seeking ingress to a protected zone or building. Another such application is the monitoring of vibration by attaching the optical fiber to a structure to be monitored and detecting disturbance of the fiber by the disturbance. The invention also encompasses the measurement of the magnitude and other characteristics of a disturbance such as the relative acoustic power in different frequency bands by monitoring an optical fiber subjected to such audo power.

Embodiments of the present invention advantageously may provide improved performance as compared with previous methods and systems for monitoring for disturbance of an optical fiber by transmitting polarized light through the fiber, especially in terms of reducing susceptibility to fading problems without increasing manufacturing costs and impairing manufacturability by design constraints.

For these reasons, it would be valuable not to require linear or circular filters with specific relationships, but to allow filters associated with either the launch or detection functions to have much more general elliptical polarization states, with only those constraints on their relationships that are necessary to ensure good measurements, and with suitable calibration techniques so that data reduction is sufficiently accurate. This would allow the monitoring unit to be built using “imperfect” parts thereby improving manufacturability.

The entire contents of each of the patents and technical articles mentioned hereinbefore are incorporated herein by reference.

Although embodiments of the invention have been described and illustrated in detail, it is to be clearly understood that the same are by way of illustration and example only and not to be taken by way of limitation, the scope of the present invention being limited only by the appended claims. 

The invention claimed is:
 1. A method of monitoring a length of single-mode optical fiber for physical disturbance along its length, comprising the steps of: (i) at a first location, launching into the fiber polarized light having at least two different predetermined launch states of polarization (L_(a),L_(b)) whose respective Stokes vectors (s_(La), s_(Lb)) are linearly independent of each other; (ii) at a second location downstream from the first location, receiving the light from the fiber; (iii) analyzing the received light using at least two different analyzer states of polarization that are characterized by respective Stokes vectors that are linearly-independent of each other; (iv) detecting the analyzed light to provide corresponding electrical detection signals and deriving therefrom corresponding digital detection signals; (v) deriving from the digital detection signals one or more representations ([s_(a)(t₁), s_(b)(t₁)], [s_(a)(t₂), s_(b)(t₂)]; M(t₁,t₂); c(t₁,t₂); C(t₁), C(t₂)) of a change in the polarization coupling properties (R) of the fiber between a first time (t₁) and a second time (t₂); (vi) computing from said one or more representations at least one measure of change in polarization coupling properties of the fiber, said at least one measure being substantially independent of said launch states and said analyzer states; and (vii) determining with reference to preselected criteria of reportable physical disturbances whether or not said at least one measure indicates a reportable physical disturbance along said length of fiber.
 2. A method according to claim 1, wherein said one or more representations comprise a plurality of said representations ([s_(a)(t₁), s_(b)(t₁)], [s_(a)(t₂), s_(b)(t₂)]; M(t₁,t₂); c(t₁,t₂); C(t₁), C(t₂)) over a predetermined period of time, each successive representation being stored and with preceding representations providing change information comprising a sequence ([s_(a)(t₁), s_(b)(t₁)], [s_(a)(t₂), s_(b)(t₂)], [s_(a)(t₃), s_(b)(t₃)], . . . ; M(t₁,t₂), M(t₂,t₃), . . . ; c(t₁,t₂), c(t₂,t₃), . . . ; C(t₀), C(t₁), C(t₂), . . . ) of change information for the fiber polarization coupling (R) over said predetermined period of time.
 3. A method according to claim 1, wherein said at least one measure and said preselected criteria are based upon the time domain.
 4. A method according to claim 1, wherein said at least one measure and said preselected criteria are based upon the frequency domain.
 5. A method according to claim 1, wherein said at least one measure is based upon either or both of time domain and frequency domain and said preselected criteria are based upon either or both of time domain and frequency domain and said determination is based upon geometric properties of the changes in the polarization coupling.
 6. A method according to claim 1, wherein said at least one measure and said preselected criteria are based upon the frequency domain, and the determination (i) uses Fourier analysis to convert the change information over a prescribed time period into the frequency domain and obtain a power spectrum characterizing the apparent disturbance, (ii) computes measures of the power distribution in different frequency bands, and (iii) compares the measures with values for these measures that are known to characterize reportable and/or unreportable disturbances.
 7. A method according to claim 6, wherein the determination of whether or not a reportable disturbance is indicated is made by analysis of the power spectrum to identify one or more frequencies at which the power level fails to meet specified power-frequency criteria.
 8. A method according to claim 1, wherein said at least one measure and said preselected criteria are both based upon the time domain and the method further comprises computing from the one or more representations at least one further measure based upon the frequency domain, wherein said at least one measure and said preselected criteria are based upon the frequency domain, and the determination (i) uses Fourier analysis to convert the change information over a prescribed time period into the frequency domain and obtain a power spectrum characterizing the apparent disturbance, (ii) computes measures of the power distribution in different frequency bands, and (iii) compares the measures with further preselected criteria comprising values for these measures that are known to characterize reportable and/or unreportable disturbances.
 9. A method according to claim 1, wherein said criteria include preset thresholds representing acceptable changes in the fiber transformation properties and the determination is made according to how many times the threshold is exceed within a specified period of time.
 10. A system for monitoring a length of single-mode optical fiber for physical disturbance at a location or zone (L) along its length, comprises a launch unit and a monitoring unit: the launch unit being adapted for launching into one end of the length of fiber polarized light at a first location light having at least two different predetermined launch states of polarization (L_(a),L_(b)) whose respective Stokes vectors (s_(La), s_(Lb)) are linearly independent of each other; the monitoring unit being adapted for receiving the light from a second end of the fiber and comprising: a polarization analyzer system for analyzing the received light using at least two different analyzer states of polarization that are characterized by respective Stokes vectors that are linearly-independent of each other: an optical detection system for detecting the analyzed light and providing corresponding electrical detection signals; an analog-to-digital convertor system for deriving from the electrical detection signals corresponding digital detection signals; a processor operable for: (i) deriving from the digital detection signals one or more representations ([s_(a)(t₁), s_(b)(t₁)], [s_(a)(t₂), s_(b)(t₂)]; M(t₁,t₂); c(t₁,t₂); C(t₁), C(t₂)) of a change in the polarization coupling properties (R) of the fiber between a first time (t₁) and a second time (t₂); (ii) computing from said one or more representations at least one measure of change in polarization coupling properties of the fiber, said at least one measure being substantially independent of said launch states and said analyzer states; and (iii) determining with reference to preselected criteria of reportable physical disturbances whether or not said measure or measures indicate a reportable physical disturbance.
 11. A system according to claim 10, wherein said one or more representations comprise a plurality of said representations (M(t₁,t₂); c(t₁,t₂); C(t₂)) derived over a predetermined period of time, the processor storing each successive representation with preceding representations, if any, to provide change information comprising a sequence (M(t₁,t₂), M(t₂,t₃), . . . ; c(t₁,t₂), c(t₂,t₃), . . . ; C(t₀), C(t₁), C(t₂), . . . ) of change information for the fiber polarization coupling (R) over said predetermined period of time.
 12. A system according to claim 10, wherein said at least one measure and said preselected criteria are based upon the time domain.
 13. A system according to claim 10, wherein said at least one measure and said preselected criteria are based upon the frequency domain.
 14. A system according to claim 10, wherein said at least one measure is based upon either or both of time domain and frequency domain and said preselected criteria are based upon either or both of time domain and frequency domain and said determination is based upon geometric properties of the changes in the polarization coupling.
 15. A system according to claim 10, wherein said at least one measure and said preselected criteria are based upon the frequency domain, and in making the determination the processor (i) uses Fourier analysis to convert the change information over a prescribed time period into the frequency domain and obtain a power spectrum characterizing the apparent disturbance, (ii) computes measures of the power distribution in different frequency bands, and (iii) compares the measures with values for these measures that are known to characterize reportable and/or unreportable disturbances.
 16. A system according to claim 10, wherein the determination of whether or not a reportable disturbance is indicated is made by analysis of a power spectrum of the change information to identify one or a combination of frequencies for which a spectral power level fails to meet specified power-frequency criteria.
 17. A system according to claim 10, wherein said at least one measure and said preselected criteria are both based upon the time domain and the processor further computes from the one or more representations at least one further measure based upon the frequency domain, wherein said at least one measure and said preselected criteria are based upon the frequency domain, wherein the processor uses Fourier analysis to convert the change information over a prescribed time period into the frequency domain and obtain a power spectrum characterizing the apparent disturbance, computes measures of the power distribution in different frequency bands, and compares the measures with further preselected criteria comprising values for these measures that are known to characterize reportable and/or unreportable disturbances.
 18. A system according to claim 10, wherein said criteria include preset thresholds representing acceptable changes in the fiber transformation properties and the determination is made according to how many times the threshold is exceed within a specified period of time. 